Model-based robust deep learning
US-2022101627-A1 · Mar 31, 2022 · US
US12427661B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12427661-B2 |
| Application number | US-202117474454-A |
| Country | US |
| Kind code | B2 |
| Filing date | Sep 14, 2021 |
| Priority date | Sep 14, 2021 |
| Publication date | Sep 30, 2025 |
| Grant date | Sep 30, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
Provided is a process, including: obtaining, with a computer system, access to a specification indicating which regions of an embedding space are designated as anomalous relative to vectors in the embedding space characterizing past behavior of a first instance of a dynamical system; receiving, with the computer system, multi-channel input indicative of a state of a second instance of the dynamical system; and classifying, with the computer system, whether the state of the second instance of the dynamical system is anomalous by: encoding the multi-channel input into a vector in the embedding space; causing the specification to be applied to the vector; obtaining a result of applying the specification to the vector; and classifying whether the state of the second instance of the dynamical system is anomalous based on the result; and storing the classification in memory.
Opening claim text (preview).
What is claimed is: 1. A system for detecting anomalous robot states, the system comprising: a robot having a robot body with joints, the robot comprising one or more processors and computer program instructions that, when executed, cause the one or more processors to perform operations comprising: determining a sequence of embeddings, wherein each embedding encodes a time-slice of a plurality of channels of sensor data indicative of a state of the robot and an environment of the robot in a lower-dimensional space than a dimensionality of the sensor data, the state of the robot comprising joint states of the robot, the lower-dimensional space having more than 10 dimensions, wherein the sequence of embeddings is associated with a specified task that the robot has been instructed to complete; determining, based on the sequence of embeddings and based on a reinforcement learning model trained for performing the specified task, an action of the specified task for the robot to perform; causing the robot to perform the action; obtaining sensor data of the robot, wherein the sensor data comprises image data generated by a camera of the robot, and wherein the sensor data further comprises an indication of joint states of the robot; generating, based on inputting the sensor data into an embedding model, a first embedding indicating a current state of the robot, wherein the current state of the robot comprises current joint states of the robot; determining, with an anomaly detection model, based on the first embedding, that the first embedding corresponds to an anomaly in the current state of the robot, wherein the anomaly detection model is trained on a set of sequences of embeddings in an embedding space to identify an anomalous area or a non-anomalous area of the embedding space, wherein each sequence of embeddings in the set of sequences of embeddings corresponds to different movements of joints of a robot instance while the robot instance performed a task, and wherein each embedding in the sequence of embeddings encodes a plurality of channels of sensor data of the corresponding robot instance in the embedding space; and in response to determining that the first embedding corresponds to an anomaly in the current state of the robot, preventing the robot from performing additional actions associated with the specified task. 2. The system of claim 1 , wherein the anomaly detection model comprises a clustering model, and wherein determining that the first embedding corresponds to an anomaly comprises: determining that the first embedding does not correspond to any cluster of a plurality of clusters, wherein each cluster of the plurality of clusters comprises embeddings corresponding to tasks that the robot has been trained to complete. 3. The system of claim 1 , wherein determining that the first embedding corresponds to an anomaly comprises: determining that the first space embedding is not contained within a multidimensional volume corresponding to a bounding envelope in embedding space observed during training. 4. The system of claim 1 , wherein determining that the first embedding corresponds to an anomaly in the current state of the robot comprises: generating a similarity score by using a distance metric to compare the first embedding with a second embedding of the sequence of embeddings; determining that the similarity score is lower than a threshold similarity score; and in response to determining that the similarity score is lower than a threshold similarity score, classifying the first embedding as an anomaly. 5. The system of claim 1 , wherein determining that the first embedding corresponds to an anomaly in the current state of the robot comprises: generating, based on inputting the first embedding into the anomaly detection model, a score indicative of whether the first embedding is an anomaly; and based on determining that the score fails to satisfy a threshold, causing the robot to continue to perform the specified task. 6. The system of claim 1 , wherein the instructions, when executed, effectuate operations further comprising: generating, based on inputting the first embedding into the anomaly detection model, a score indicative of whether the first embedding is an anomaly; and based on determining that the score fails to satisfy a threshold, causing the robot to move to a charging station associated with the robot. 7. The system of claim 1 , wherein the instructions, when executed, effectuate operations further comprising: in response to determining that the first embedding corresponds to an anomaly, sending an alert to a server, wherein the alert indicates that a teleoperator should take control over the robot. 8. The system of claim 1 , wherein the sensor data comprises information obtained from a motor position sensor of the robot, a touch sensor located in a finger of the robot, a motor current sensor of the robot, and a depth camera of the robot. 9. The system of claim 1 , further comprising: adjusting a first weight of the embedding model based on the action determined by the reinforcement learning model; and adjusting a second weight of the reinforcement learning model based on the first embedding generated via the reinforcement learning model. 10. The system of claim 1 , further comprising: in response to determining that the first embedding corresponds to an anomaly in the current state of the robot, adjusting a weight of the reinforcement learning model, wherein adjusting the weight of the reinforcement learning model reduces a likelihood of the robot performing an action associated with an anomalous embedding. 11. The system of claim 1 , wherein a time of less than 100 milliseconds transpires between determining that the first embedding corresponds to an anomaly in the current state of the robot and preventing the robot from performing further actions associated with the specified task. 12. The system of claim 1 , wherein the system comprises a server and the server is configured to perform operations comprising: receiving data from a plurality of robots, wherein the plurality of robots comprises the robot; generating an updated anomaly detection model by training, based on the data, the anomaly detection model; and sending the updated anomaly detection model to the robot, wherein the data comprises embeddings generated by each robot of the plurality of robots and sensor data from each robot of the plurality of robots. 13. The system of claim 1 , wherein the system comprises a server and the instructions, when executed, effectuate operations further comprising: in response to determining that the first embedding corresponds to an anomaly, sending a query to the server, wherein the query comprises the first embedding; in response to sending the query to the server, receiving a second sequence of embeddings; and causing the robot to perform actions corresponding to the second sequence of embeddings. 14. The system of claim 1 , wherein the system comprises a server and the server is configured to perform operations comprising: in response to receiving a query associated with the first embedding, determining, based on the first embedding, an action for the robot to perform; and sending, to the robot, instructions for performing the action, wherein determining an action for the robot to perform comprises: comparing the first embedding with a plurality of embeddings, wherein each embedding of the plurality of embeddings corresponds to an anomaly, and wherein each embedding of the plurality of embeddings is associated with a sequence of embeddings generated from actions performed by a
Clustering techniques · CPC title
Dual arm manipulator; Coordination of several manipulators · CPC title
including video camera means · CPC title
Teleoperation · CPC title
learning, adaptive, model based, rule based expert control · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.