System and method for detection of, prevention of, and recovery from software execution failure

US11113147B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-11113147-B2
Application numberUS-202017060462-A
CountryUS
Kind codeB2
Filing dateOct 1, 2020
Priority dateApr 27, 2018
Publication dateSep 7, 2021
Grant dateSep 7, 2021

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

Systems and methods are disclosed herein for monitoring, detecting, and mitigating hardware and software failures. An error detection module monitors the execution of software processes and detects failures of the monitored processes. The error detection module may monitor reboot events and correlate reboot events with failures of the monitored software processes. If a monitored process fails, the error detection module may log the failure and its cause. If the same process has failed numerous times, causing the user device to experience a reboot loop, remedial action may be taken based on the cause of the failure.

First claim

Opening claim text (preview).

What is claimed is: 1. A method for handling software execution failures on a user device, the method comprising: monitoring execution of a software process on the user device; detecting a reboot of the user device; determining whether the reboot was caused by a prerequisite process not having been completed; and in response to determining that the cause of the reboot is a prerequisite process not having been completed, slowing a speed at which the software process is executed to allow the prerequisite process to complete prior to execution of the software process. 2. The method of claim 1 , further comprising: incrementing a counter; and determining whether the value of the counter exceeds a failure threshold; wherein slowing the speed at which the software process is executed occurs further in response to determining that the value of the counter exceeds the failure threshold. 3. The method of claim 2 , wherein the counter is a reboot counter. 4. The method of claim 2 , wherein the counter is one of a plurality of counters, each counter being associated with a respective cause of a reboot, and wherein incrementing the counter further comprises identifying a respective counter of the plurality of counters associated with the prerequisite process. 5. The method of claim 1 , further comprising: detecting, within a threshold period of time after the reboot, a second reboot of the user device; determining whether the second reboot was also caused by the prerequisite process not having been completed. 6. The method of claim 5 , further comprising: executing a limited boot process; and transmitting, to a server, a request for a software update. 7. The method of claim 1 , wherein the software process comprises a plurality of steps, and wherein slowing the speed at which the software process is executed comprises pausing execution of the software process for a period of time after each step. 8. The method of claim 1 , wherein the software process comprises a plurality of steps, the method further comprising: identifying a step of the plurality of steps at which the reboot occurred; and monitoring execution of the prerequisite process; wherein slowing the speed at which the software process is executed comprises preventing execution of the identified step until the prerequisite process has been completed. 9. The method of claim 1 , wherein the software process stores status information related to the execution and failure of the software process in a data structure. 10. The method of claim 9 , wherein determining the cause of the reboot further comprises: determining a time at which the reboot occurred; accessing the data structure; determining status information having a timestamp at or near the determined time at which the reboot occurred; and determining, based on the status information, the cause of the reboot. 11. A system for handling software execution failures on a user device, the system comprising: memory; and control circuitry configured to: monitor execution of a software process on the user device; detect a reboot of the user device; determine whether the reboot was caused by a prerequisite process not having been completed; and in response to determining that the cause of the reboot is a prerequisite process not having been completed, slow a speed at which the software process is executed to allow the prerequisite process to complete prior to execution of the software process. 12. The system of claim 11 , wherein the control circuitry is further configured to: increment a counter; and determine whether the value of the counter exceeds a failure threshold; slow the speed at which the software process in response to determining that the value of the counter exceeds the failure threshold. 13. The system of claim 12 , wherein the counter is a reboot counter. 14. The system of claim 12 , wherein the counter is one of a plurality of counters, each counter being associated with a respective cause of a reboot, and wherein the control circuitry configured to increment the counter is further configured to identify a respective counter of the plurality of counters associated with the prerequisite process. 15. The system of claim 11 , wherein the control circuitry is further configured to: detect, within a threshold period of time after the reboot, a second reboot of the user device; determine whether the second reboot was also caused by the prerequisite process not having been completed. 16. The system of claim 15 , wherein the control circuitry is further configured to: execute a limited boot process; and transmit, to a server, a request for a software update. 17. The system of claim 15 , wherein the software process comprises a plurality of steps, and wherein the control circuitry configured to slow the speed at which the software process is executed is further configured to pause execution of the software process for a period of time after each step. 18. The system of claim 15 , wherein the software process comprises a plurality of steps, and wherein the control circuitry is further configured to: identify a step of the plurality of steps at which the reboot occurred; monitor execution of the prerequisite process; and slow the speed at which the software process is executed by preventing execution of the identified step until the prerequisite process has been completed. 19. The system of claim 15 , wherein the software process stores, in the memory, status information related to the execution and failure of the software process in a data structure. 20. The system of claim 19 , wherein the control circuitry configured to determine the cause of the reboot is further configured to: determine a time at which the reboot occurred; access, from the memory, the data structure; determine status information having a timestamp at or near the determined time at which the reboot occurred; and determine, based on the status information, the cause of the reboot.

Assignees

Inventors

Classifications

  • G06F11/076Primary

    by exceeding a count or rate limit, e.g. word- or bit count limit · CPC title

  • where the computing system component is a software system · CPC title

  • Boot up procedures · CPC title

  • Remedial or corrective actions (recovery from an exception in an instruction pipeline G06F9/3861; by retry G06F11/1402; for recovering from a failure of a protocol instance or entity H04L69/40) · CPC title

  • Root cause analysis, i.e. error or fault diagnosis (in a hardware test environment G06F11/22; in a software test environment G06F11/36) · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US11113147B2 cover?
Systems and methods are disclosed herein for monitoring, detecting, and mitigating hardware and software failures. An error detection module monitors the execution of software processes and detects failures of the monitored processes. The error detection module may monitor reboot events and correlate reboot events with failures of the monitored software processes. If a monitored process fails, …
Who is the assignee on this patent?
Rovi Guides Inc
What technology area does this patent fall under?
Primary CPC classification G06F11/076. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Sep 07 2021 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 2 related publications on this page (citations in our corpus or others sharing the same primary CPC).