Supplementing user perception and experience with augmented reality (AR) and artificial intelligence (AI) techniques utilizing an artificial intelligence (AI) agent

US12518491B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12518491-B2
Application numberUS-202318097040-A
CountryUS
Kind codeB2
Filing dateJan 13, 2023
Priority dateJan 13, 2023
Publication dateJan 6, 2026
Grant dateJan 6, 2026

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

According to examples, a system for supplementing user perception and experience via augmented reality (AR), artificial intelligence (AI), and machine-learning (ML) techniques is described. The system may include a processor and a memory storing instructions. The processor, when executing the instructions, may cause the system to receive data associated with at least one of a location, context, or setting and determine, using at least one artificial intelligence (AI) model and at least one machine learning (ML) model, relationships between objects in the at least one of the location, context, or setting. The processor, when executing the instructions, may then apply an artificial intelligence (AI) agent to analyze the relationships and generate a three-dimensional (3D) mapping of the at least one of the location, context, or setting and provide an output to aid a user's perception and experience.

First claim

Opening claim text (preview).

The invention claimed is: 1 . A system, comprising: a processor; and a memory storing instructions, which when executed by the processor, cause the processor to: determine, using at least one artificial intelligence (AI) model, relationships between objects in a field of view of a camera of a headset worn by a user; apply an artificial intelligence (AI) agent associated with the user to analyze the relationships determined by the at least one artificial intelligence (AI) model and generate a three-dimensional (3D) mapping that includes one or more objects within the field of view, wherein the artificial intelligence (AI) agent is personalized to the user based on previous user interactions with the artificial intelligence (AI) agent; and provide an output via at least one output modality of a plurality of output modalities, the at least one output modality determined by the artificial intelligence (AI) agent, to aid a user's perception and experience of the field of view, wherein the output is based on the three-dimensional (3D) mapping and includes a prediction, made by the artificial intelligence (AI) agent, for a user activity that is associated with the field of view. 2 . The system of claim 1 , wherein the instructions, when executed by the processor, further cause the processor to implement the artificial intelligence (AI) agent to conduct a localization and mapping analysis to generate the three-dimensional (3D) mapping. 3 . The system of claim 1 , wherein the headset is a pair of augmented reality (AR) glasses. 4 . The system of claim 1 , wherein the at least one artificial intelligence (AI) model comprises at least one of a large language model (LLM), a generative adversarial network (GAN), a tree-based model, a Bayesian network, a support vector, clustering, a kernel method, a spline, or a knowledge graph. 5 . The system of claim 1 , wherein generating the three-dimensional (3D) mapping includes: determining a location of each of the one or more objects relative to a location of the user based on, at least, the field of view. 6 . The system of claim 1 , wherein the instructions, when executed by the processor, further cause the processor to: receive image data of the field of view from the camera of the headset; and perform image analysis of the image data, wherein the determining the relationships between objects in the field of view is based on the image data. 7 . The system of claim 1 , wherein the instructions, when executed by the processor, further cause the processor to: determine a risk associated with the one or more objects, wherein the output is further based on the risk. 8 . A method for supplementing user perception and experience, comprising: determining, using at least one artificial intelligence (AI) model, relationships between objects in a field of view of a camera of a headset worn by a user; applying an artificial intelligence (AI) agent associated with the user to analyze the relationships determined by the at least one artificial intelligence (AI) model and generating a three-dimensional (3D) mapping that includes one or more objects within the field of view, wherein the artificial intelligence (AI) agent is personalized to the user based on previous user interactions with the artificial intelligence (AI) agent; and providing an output via at least one output modality of a plurality of output modalities, the at least one output modality determined by the artificial intelligence (AI) agent, to aid a user's perception and experience of the field of view wherein the output is based on the three-dimensional (3D) mapping and includes a prediction, made by the artificial intelligence (AI) agent, for a user activity that is associated with the field of view. 9 . The method of claim 8 , further comprising implementing the artificial intelligence (AI) agent to conduct a localization and mapping analysis to generate the three-dimensional (3D) mapping. 10 . The method of claim 8 , wherein the headset is a pair of augmented reality (AR) glasses. 11 . The method of claim 8 , wherein the at least one artificial intelligence (AI) model comprises at least one of a large language model (LLM), a generative adversarial network (GAN), a tree-based model, a Bayesian network, a support vector, clustering, a kernel method, a spline, or a knowledge graph. 12 . The method of claim 8 , wherein generating the three-dimensional (3D) mapping includes: determining a location of each of the one or more objects relative to a location of the user based on, at least, the field of view. 13 . The method of claim 8 , further comprising: receiving image data of the field of view from the camera of the headset; and performing image analysis of the image data, wherein the determining the relationships between objects in the field of view is based on the image data. 14 . A non-transitory computer-readable storage medium having an executable stored thereon, which when executed instructs a processor to: determine, using at least one artificial intelligence (AI) model, relationships between objects in a field of view of a camera of a headset worn by a user; apply an artificial intelligence (AI) agent associated with the user to analyze the relationships determined by the at least one artificial intelligence (AI) model and generate a three-dimensional (3D) mapping that includes one or more objects within the field of view, wherein the artificial intelligence (AI) agent is personalized to the user based on previous user interactions with the artificial intelligence (AI) agent; and provide an output via at least one output modality of a plurality of output modalities, the at least one output modality determined by the artificial intelligence (AI) agent, to aid a user's perception and experience of the field of view, wherein the output is based on the three-dimensional (3D) mapping and includes a prediction, made by the artificial intelligence (AI) agent, for a user activity that is associated with the field of view. 15 . The non-transitory computer-readable storage medium of claim 14 , wherein the executable when executed further instructs the processor to implement the artificial intelligence (AI) agent to conduct a localization and mapping analysis to generate the three-dimensional (3D) mapping. 16 . The non-transitory computer-readable storage medium of claim 14 , wherein generating the three-dimensional (3D) mapping includes: determining a location of each of the one or more objects relative to a location of the user based on, at least, the field of view. 17 . The non-transitory computer-readable storage medium of claim 14 , wherein the executable when executed further instructs the processor to: receive image data of the field of view from the camera of the headset; and perform image analysis of the image data, wherein the determining the relationships between objects in the field of view is based on the image data. 18 . The non-transitory computer-readable storage medium of claim 14 , wherein the executable when executed further instructs the processor to: determine a risk associated with the one or more objects, wherein the output is further based on the risk. 19 . The non-transitory computer-readable storage medium of claim 14 , wherein the headset is a pair of augmented reality (AR) glasses. 20 . The non-transitory computer-readable storage medium of claim 14 , wherein the at least one artificial intelligence (AI) model comprises at least one of a large lan

Assignees

Inventors

Classifications

  • using probabilistic graphical models from image or video features, e.g. Markov models or Bayesian networks · CPC title

  • characterised by optical features · CPC title

  • Eyeglass type (eyeglass details G02C) · CPC title

  • using neural networks · CPC title

  • G06T19/006Primary

    Mixed reality (object pose determination, tracking or camera calibration for mixed reality G06T7/00) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12518491B2 cover?
According to examples, a system for supplementing user perception and experience via augmented reality (AR), artificial intelligence (AI), and machine-learning (ML) techniques is described. The system may include a processor and a memory storing instructions. The processor, when executing the instructions, may cause the system to receive data associated with at least one of a location, context,…
Who is the assignee on this patent?
Meta Platforms Inc
What technology area does this patent fall under?
Primary CPC classification G02B27/0172. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Jan 06 2026 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 4 related publications on this page (citations in our corpus or others sharing the same primary CPC).