Document search system and document search method

US2025068669A1 · US · A1

Patent metadata
FieldValue
Publication numberUS-2025068669-A1
Application numberUS-202418945924-A
CountryUS
Kind codeA1
Filing dateNov 13, 2024
Priority dateMay 24, 2019
Publication dateFeb 27, 2025
Grant date

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A document search system that enables efficient document search regardless of the ability of a user is achieved. Document search is performed using a document search system in which database document data is stored. After first document data and second document data are input to the document search system, the document search system extracts a plurality of terms from the first document data. The extraction of the terms is performed using morphological analysis, for example. Next, the extracted terms are weighted on the basis of the second document data. For example, texts included in a document represented by the second document data are classified into first and second texts. Among the terms extracted from the first document data, the weight of the term included in the first text is set larger than the weights of the other terms. The classification of the texts can be performed in accordance with a rule basis or using machine learning. After that, the similarity of the database document data to the first document data is calculated on the basis of the weighted term.

First claim

Opening claim text (preview).

1 . A semiconductor devise having instructions stored thereon which, when executed by one or more processors, cause the one or more processors to perform operations for searching documents, the operations comprising: receiving first document data and second document data; extracting a plurality of terms from the first document data; weighting at least one of the plurality of terms on a basis of the second document data; calculating a similarity of the database document data to the first document data on a basis of the at least one weighted term; and outputting the calculated similarity, wherein, after the plurality of terms are extracted, texts included in a document represented by the second document data are classified using machine learning, wherein the first document data represents a scope of claims of a patent application, wherein the second document data represents a written opinion against a reason for refusal of the patent application, and wherein the second document data represents a document which includes content described in a document represented by the first document data. 2 . The semiconductor devise according to claim 1 , further the operations comprising: classifying texts included in the document represented by the second document data into a first text and a second text, and setting a weight of the term included in the first text larger than a weight of the term not included in the first text among the terms extracted from the first document data. 3 . The semiconductor devise according to claim 2 , further the operations comprising: performing machine learning, and performing the classification of texts on the basis of a learning result of the machine learning. 4 . The semiconductor devise according to claim 3 , further the operations comprising: inputting first learning document data; and performing the machine learning so that output data becomes closer to second learning document data, wherein the first learning document data is the same kind of document data as the second document data, and wherein the second learning document data is document data obtained by labeling the first learning document data. 5 . The semiconductor devise according to claim 1 , further the operations comprising: extracting the plurality of terms using morphological analysis.

Assignees

Inventors

Classifications

  • Morphological analysis · CPC title

  • Document management systems · CPC title

  • Machine learning · CPC title

  • Selection or weighting of terms from queries, including natural language queries · CPC title

  • Parsing · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US2025068669A1 cover?
A document search system that enables efficient document search regardless of the ability of a user is achieved. Document search is performed using a document search system in which database document data is stored. After first document data and second document data are input to the document search system, the document search system extracts a plurality of terms from the first document data. Th…
Who is the assignee on this patent?
Semiconductor Energy Lab
What technology area does this patent fall under?
Primary CPC classification G06F16/35. Mapped technology areas include Physics.
When was this patent published?
Publication date Thu Feb 27 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (A1). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 8 related publications on this page (citations in our corpus or others sharing the same primary CPC).