Who is the assignee on this patent?

Unlikely Artificial Intelligence Ltd

What technology area does this patent fall under?

Primary CPC classification G06F40/30. Mapped technology areas include Physics.

When was this patent published?

Publication date Tue Jun 11 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.

What related patents are in patentsdb?

We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).

Computer implemented methods for the automated analysis or use of data, including use of a large language model

US12008333B2 · US · B2

Patent metadata
Field	Value
Publication number	US-12008333-B2
Application number	US-202318515488-A
Country	US
Kind code	B2
Filing date	Nov 21, 2023
Priority date	Aug 24, 2021
Publication date	Jun 11, 2024
Grant date	Jun 11, 2024

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

Title
What the patent document calls the invention.
Abstract
A short plain-language summary of the technical disclosure.
Assignees and inventors
Who owns or filed the patent and who is credited as inventor.
Key dates
Filing, priority, publication, and grant dates set the timeline.
First independent claim
The legal scope of protection — read this for what is actually claimed.
CPC / IPC classifications
Technology tags used to group this patent with similar filings.
Citations and related patents
Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Methods are provided, such as a method of interacting with a large language model (LLM), including the step of a processing system using a structured, machine-readable representation of data that conforms to a machine-readable language, such as a universal language, to provide new context data for the LLM, in order to improve the output, such as continuation text output, generated by the LLM in response to a prompt; and such as a method of interacting with a LLM, including the step of providing continuation data generated by the LLM to a processing system that uses a structured, machine-readable representation of data that conforms to a machine-readable language, such as a universal language, in which the processing system is configured to analyse the continuation output generated by the LLM in response to a prompt to enable an improved version of that continuation output to be provided to a user. Related computer systems are provided.

First claim

Opening claim text (preview).

The invention claimed is: 1. A computer-implemented method of automatically removing hallucinations from natural language text generated by a large language model (LLM), including the steps of: (a) providing a prompt or query to the LLM; (b) automatically generating a baseline response to the prompt or query, the baseline response including factual assertions; (c) automatically generating one or more verification questions to test the accuracy of one or more of the factual assertions for factual accuracy or inaccuracy; (d) systematically answering the or each verification questions in a manner that is not dependent on the baseline response; (e) using the answers to the or each verification question to identify one or more factual inaccuracies or hallucinations present in the baseline response; (f) automatically using the or each answer to the verification question or questions to generate a final natural language output, in which one or more factual inaccuracies or hallucinations present in the baseline response, have been removed. 2. The method of claim 1 in which there are multiple verification questions forming a sequence or chain of verification. 3. The method of claim 1 in which one or more of the verification questions do not include phrasing that matches the prompt or the baseline response. 4. The method of claim 1 in which one or more of the verification questions are simpler than the original prompt. 5. The method of claim 1 in which one or more of the verification questions are answered independently. 6. The method of claim 5 in which one or more of the verification questions do not include phrasing that matches the prompt or query, or the baseline response. 7. The method of claim 1 in which one or more of the verification questions are automatically answered as either true or false by the LLM. 8. The method of claim 1 in which one or more of the verification questions are open verification questions. 9. The method of claim 1 in which one or more of the verification questions are generated by the LLM. 10. The method of claim 9 in which the verification questions are generated by the LLM in response to a prompt. 11. The method of claim 1 in which the method includes the step of using explicit reasoning steps when cross-checking whether each of the answers to the verification questions indicate an inconsistency with the baseline response. 12. The method of claim 1 including the step of extracting from the baseline response a collection of one or more factual assertions by providing the LLM with the baseline response and a prompt asking the LLM to provide a continuation generating the factual assertions. 13. The method of claim 12 where a factual assertion derived from the baseline response is automatically translated into a verification question. 14. The method of claim 1 where, if a verification question is answered as false, then the related factual assertions derived from the draft natural language text is treated as false and is not included in the final natural language output. 15. The method of claim 1 where the sources of any previous known knowledge used to validate a factual assertion as false are returned as part of the process of checking one or more of the factual assertions for factual accuracy or inaccuracy. 16. The method of claim 1 where the checking the one or more factual assertions for factual accuracy or inaccuracy includes a further step of generating a natural language explanation of why a factual assertion is determined to be true or false. 17. The method of claim 1 in which the method includes the step of supplementing the information stored in the weights of the LLM by referencing a semantic representation of knowledge. 18. The method of claim 17 in which the semantic representation of knowledge is used to provide additional context to the LLM. 19. The method of claim 1 in which the method includes the step of supplementing the information stored in the weights of the LLM by referencing dynamic real-time information. 20. The method of claim 1 which includes the step of providing a reasoning system able to reason with a symbolic representation of the world, translating the verification questions or assertions into a representation compatible with the reasoning system and utilising the reasoning system to identify assertions which are true or false. 21. The method of claim 20 in which the reasoning system uses a universal language. 22. A computer implemented system configured to automatically remove hallucinations from natural language text generated by a LLM, the system being configured to: (a) receive a prompt or query to the LLM; (b) automatically generate a baseline response to the prompt or query, the baseline response including factual assertions; (c) automatically generate one or more verification questions to test the accuracy of one or more of the factual assertions for factual accuracy or inaccuracy; (d) systematically answer the or each verification questions in a manner that is not dependent on the baseline response; (e) use the answers to the or each verification question to identify one or more factual inaccuracies or hallucinations present in the baseline response; (f) automatically use the or each answer to the verification question or questions to generate a final natural language output, in which one or more factual inaccuracies or hallucinations present in the baseline response, have been removed. 23. A large language model system configured to automatically remove hallucinations from natural language text generated by the LLM system, the LLM system being configured to: (a) receive a prompt or query; (b) automatically generate a baseline response to the prompt or query, the baseline response including factual assertions; (c) automatically generate one or more verification questions to test the accuracy of one or more of the factual assertions for factual accuracy or inaccuracy; (d) systematically answer the or each verification questions in a manner that is not dependent on the baseline response; (e) use the answers to the or each verification question to identify one or more factual inaccuracies or hallucinations present in the baseline response; (f) automatically use the or each answer to the verification question or questions to generate a final natural language output, in which one or more factual inaccuracies or hallucinations present in the baseline response, have been removed.

Assignees

Unlikely Artificial Intelligence Ltd

Inventors

Classifications

G06F40/30Primary
Semantic analysis · CPC title
G06F40/205
Parsing · CPC title
G06N3/0895
Weakly supervised learning, e.g. semi-supervised or self-supervised learning · CPC title
G06N3/0442
characterised by memory or gating, e.g. long short-term memory [LSTM] or gated recurrent units [GRU] · CPC title
G06F40/56Primary
Natural language generation · CPC title

Patent family

Related publications grouped by family.

View patent family 88195841

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12008333B2 cover?: Methods are provided, such as a method of interacting with a large language model (LLM), including the step of a processing system using a structured, machine-readable representation of data that conforms to a machine-readable language, such as a universal language, to provide new context data for the LLM, in order to improve the output, such as continuation text output, generated by the LLM in…
Who is the assignee on this patent?: Unlikely Artificial Intelligence Ltd
What technology area does this patent fall under?: Primary CPC classification G06F40/30. Mapped technology areas include Physics.
When was this patent published?: Publication date Tue Jun 11 2024 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?: We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).