Methods and systems for scheduling mmWave communications using reinforcement learning

US12439431B2 · US · B2

Patent metadata
FieldValue
Publication numberUS-12439431-B2
Application numberUS-202117354045-A
CountryUS
Kind codeB2
Filing dateJun 22, 2021
Priority dateJun 22, 2021
Publication dateOct 7, 2025
Grant dateOct 7, 2025

How to read this patent

A practical reading order for non-experts. Skip the full description unless you need deep technical detail.

  1. Title

    What the patent document calls the invention.

  2. Abstract

    A short plain-language summary of the technical disclosure.

  3. Assignees and inventors

    Who owns or filed the patent and who is credited as inventor.

  4. Key dates

    Filing, priority, publication, and grant dates set the timeline.

  5. First independent claim

    The legal scope of protection — read this for what is actually claimed.

  6. CPC / IPC classifications

    Technology tags used to group this patent with similar filings.

  7. Citations and related patents

    Prior art links and similar publications in this corpus.

Abstract

Official abstract text for this publication.

A controller for scheduling mmWave communication is provided. The controller is programmed to identify a plurality of states, each of the plurality of states indicating status of mmWave communication links among a plurality of nodes, calculate an updated value for each of the plurality of states iteratively based on a value iteration algorithm and a previous value for each of the plurality of states until the updated value for each of the plurality of states converges, and select one of the plurality of states based on the converged values for the plurality of states.

First claim

Opening claim text (preview).

What is claimed is: 1. A controller which is configured to: receive an intent to communicate from one or more of a plurality of nodes; identify a plurality of states based at least in part on the one or more intents to communicate, each of the plurality of states indicating status of mmWave communication links among the plurality of nodes; calculate an updated value for each of the plurality of states iteratively based on a value iteration algorithm and a previous value for each of the plurality of states until the updated value for each of the plurality of states converges; and select one of the plurality of states based on the converged values for the plurality of states, wherein: the updated value for each of the plurality of states is calculated at least based on a reward value and a transition probability from a first state to a second state; and the reward value is calculated based on a weight value related to a link to be added or removed and a number of conflicts due to an addition or a removal of the link. 2. The controller of claim 1 , wherein the controller is configured to: broadcast information about the selected state over a V2X channel. 3. The controller of claim 1 , wherein the updated value for each of the plurality of states is calculated using Bellman equation. 4. The controller of claim 1 , wherein: each of the mm Wave communication links is associated with a weight parameter; and the updated value for each of the plurality of states is calculated further based on the weight parameter. 5. The controller of claim 1 , wherein the controller is configured to: broadcast information about the selected state over a 5.9 GHz V2X channel. 6. The controller of claim 1 , wherein the plurality of nodes include a plurality of connected vehicles. 7. The controller of claim 1 , wherein the plurality of nodes include an edge server. 8. The controller of claim 1 , wherein one or more of the plurality of states include mm Wave communication links that conflict each other. 9. A method comprising: receiving an intent to communicate from one or more of a plurality of nodes; identifying a plurality of states based at least in part on the one or more intents to communicate, each of the plurality of states indicating status of mm Wave communication links among the plurality of nodes; calculating an updated value for each of the plurality of states iteratively based on a value iteration algorithm and a previous value for each of the plurality of states until the updated value for each of the plurality of states converges; and selecting one of the plurality of states based on the converged values for the plurality of states, wherein: the updated value for each of the plurality of states is calculated at least based on a reward value and a transition probability from a first state to a second state; and the reward value is calculated based on a weight value related to a link to be added or removed and a number of conflicts due to an addition or a removal of the link. 10. The method of claim 9 , further comprising: broadcasting information about the selected state over a V2X channel. 11. The method of claim 9 , wherein the updated value for each of the plurality of states is calculated using Bellman equation. 12. The method of claim 9 , wherein: each of the mm Wave communication links is associated with a weight parameter; and the updated value for each of the plurality of states is calculated further based on the weight parameter. 13. A vehicle system comprising: a controller which is configured to: receive an intent to communicate from one or more of a plurality of nodes; identify a plurality of states based at least in part on the one or more intents to communicate, each of the plurality of states indicating status of mmWave communication links among the plurality of nodes; calculate an updated value for each of the plurality of states iteratively based on a value iteration algorithm and a previous value for each of the plurality of states until the updated value for each of the plurality of states converges; select one of the plurality of states based on the converged values for the plurality of states; and broadcast the selected state over a V2X channel, wherein: the updated value for each of the plurality of states is calculated at least based on a reward value and a transition probability from a first state to a second state; and the reward value is calculated based on a weight value related to a link to be added or removed and a number of conflicts due to an addition or a removal of the link.

Assignees

Inventors

Classifications

  • for vehicles, e.g. vehicle-to-pedestrians [V2P] · CPC title

  • the criterion being a learning criterion · CPC title

  • Direct-mode setup · CPC title

  • Transitions between radio resource control [RRC] states · CPC title

  • for vehicle-to-vehicle communication [V2V] · CPC title

Patent family

Related publications grouped by family.

External sources

Frequently asked questions

Answers are generated from the same data shown on this page.

What does patent US12439431B2 cover?
A controller for scheduling mmWave communication is provided. The controller is programmed to identify a plurality of states, each of the plurality of states indicating status of mmWave communication links among a plurality of nodes, calculate an updated value for each of the plurality of states iteratively based on a value iteration algorithm and a previous value for each of the plurality of s…
Who is the assignee on this patent?
Toyota Eng & Mfg North America
What technology area does this patent fall under?
Primary CPC classification H04W72/30. Mapped technology areas include Electricity.
When was this patent published?
Publication date Tue Oct 07 2025 00:00:00 GMT+0000 (Coordinated Universal Time) (B2). Legal status and post-grant events are not shown on this page.
What related patents are in patentsdb?
We list 6 related publications on this page (citations in our corpus or others sharing the same primary CPC).