Method, system and storage medium of resilient human-on-the-loop range-only cooperative positioning of plurality of unmanned aerial vehicles

US12504772B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12504772-B2
Application numberUS-202318494293-A
CountryUS
Kind codeB2
Filing dateOct 25, 2023
Priority dateJan 19, 2022
Publication dateDec 23, 2025
Grant dateDec 23, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

The present disclosure provides a method, a system and a storage medium of resilient human-on-the-loop range-only cooperative positioning of a plurality of unmanned aerial vehicles (UAVs). The method includes computing an initial exploitability using an initial distribution and an initial policy; performing a forward updating of a distribution of a portion of the plurality of UAVs, and performing a backward updating of a Q function of each UAV of the plurality of UAVs; for each time step, calculating a dual variable at an (i+1)-th iteration and calculating a policy at an (i+1)-th iteration; computing a ratio of an exploitability at the (i+1)-th iteration over the initial exploitability; and if the ratio is less than or equal to a pre-defined tolerance value, maintaining a policy at the i-th iteration; and if the ratio is greater than the pre-defined tolerance value, using the policy at the (i+1)-th iteration.

First claim

Opening claim text (preview).

What is claimed is: 1 . A method of resilient human-on-the-loop range-only cooperative positioning of a plurality of unmanned aerial vehicles (UAVs), comprising: determining an initial distribution, an initial policy, and a pre-defined tolerance value by a processer; computing an initial exploitability using the initial distribution and the initial policy; for an i-th iteration, for each time step, performing a forward updating of a distribution of a portion of the plurality of UAVs by computing a distribution of the portion of the plurality of UAVs at a k-th time step using a distribution of the portion of the plurality of UAVs at a (k−1)-th time step under a policy at the (k−1)-th time step; and performing a backward updating of a Q function of each UAV of the plurality of UAVs by computing a Q function value of each UAV of the plurality of UAVs at the k-th time step using a Q function value of each UAV of the plurality of UAVs at a (k+1)-th time step; for each time step, calculating a dual variable at an (i+1)-th iteration using a dual variable at the i-th iteration and the computed Q function value of each UAV of the plurality of UAVs at the i-th iteration; and calculating a policy at the (i+1)-th iteration using the calculated dual variable at the (i+1)-th iteration; computing an exploitability at the (i+1)-th iteration and a ratio of the exploitability at the (i+1)-th iteration over the initial exploitability; and when the ratio is less than or equal to the pre-defined tolerance value, controlling the UAV movement by maintaining a policy at the i-th iteration for each UAV of the plurality of UAVs by maintaining a policy at the i-th iteration; and when the ratio is greater than the pre-defined tolerance value, using the policy at the (i+1)-th iteration for controlling the movement of each UAV of the plurality of UAVs using the policy at the (i+1)-th iteration. 2 . The method according to claim 1 , wherein the dual variable at the (i+1)-th iteration is calculated by: y k i + 1 ( s _ , μ ) = y k i ( s _ , μ ) + α ⁢ Q π k i ( s _ , μ ) wherein γ denotes the dual variable, s denotes a UAV failure probability, μ denotes a recovery rate, α denotes a step size, Q denotes a Q-function, and π denotes a policy. 3 . The method according to claim 2 , wherein the policy at the (i+1)-th iteration is calculated by: π k i + 1 ( · ❘ s _ ) = Γ ⁡ ( y k i + 1 ( s _ , μ ) ) wherein Γ denotes a function that maps the dual variable at the (i+1)-th iteration to the policy at the (i+1)-th iteration. 4 . The method according to claim 1 , further including: inputting a plurality of discretized states and a plurality of discretized actions into a mean field game (MFG) model. 5 . The method according to claim 4 , wherein: a ratio of an exploitability at the (i+1)-th iteration to the initial exploitability is configured to determine early stopping of the mean field game model. 6 . A system, comprising: a memory, configured to store program instructions for performing a method of resilient human-on-the-loop range-only cooperative positioning of a plurality of unmanned aerial vehicles (UAVs); and a processor, coupled with the memory and, when executing the program instructions, configured for: determining an initial distribution, an initial policy, and a pre-defined tolerance value by a processer; computing an initial exploitability using the initial distribution and the initial policy; for an i-th iteration, for each time step, performing a forward updating of a distribution of a portion of the plurality of UAVs by computing a distribution of the portion of the plurality of UAVs at a k-th time step using a distribution of the portion of the plurality of UAVs at a (k−1)-th time step under a policy at the (k−1)-th time step; and performing a backward updating of a Q function of each UAV of the plurality of UAVs by computing a Q function value of each UAV of the plurality of UAVs at the k-th time step using a Q function value of each UAV of the plurality of UAVs at a (k+1)-th time step; for each time step, calculating a dual variable at an (i+1)-th iteration using a dual variable at the i-th iteration and the computed Q function value of each UAV of the plurality of UAVs at the i-th iteration; and calculating a policy at the (i+1)-th iteration using the calculated dual variable at the (i+1)-th iteration; computing an exploitability at the (i+1)-th iteration and a ratio of the exploitability at the (i+1)-th iteration over the initial exploitability; and when the ratio is less than or equal to the pre-defined tolerance value, controlling the UAV movement by maintaining a policy at the i-th iteration for each UAV of the plurality of UAVs by maintaining a policy at the i-th iteration; and when the ratio is greater than the pre-defined tolerance value, using the policy at the (i+1)-th iteration for controlling the movement of each UAV of the plurality of UAVs using the policy at the (i+1)-th iteration. 7 . The system according to claim 6 , wherein the dual variable at the (i+1)-th iteration is calculated by: y k

Assignees

Inventors

Classifications

  • Probabilistic graphical models, e.g. probabilistic networks · CPC title

  • Aircraft, e.g. drones · CPC title

  • Battlefields · CPC title

  • by centralised control off-board any of the vehicles · CPC title

  • Minimising the exposure of a vehicle to threats, e.g. avoiding interceptors · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12504772B2 cover?
The present disclosure provides a method, a system and a storage medium of resilient human-on-the-loop range-only cooperative positioning of a plurality of unmanned aerial vehicles (UAVs). The method includes computing an initial exploitability using an initial distribution and an initial policy; performing a forward updating of a distribution of a portion of the plurality of UAVs, and performi…
Who is the assignee on this patent?
Intelligent Fusion Tech Inc
What technology area does this patent fall under?
Primary CPC classification G05D1/69. Mapped technology areas include Physics.
When was this patent published?
Publication date Tue Dec 23 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 12 related publications on this page (citations in our corpus or others sharing the same primary CPC).