Cognitive state based vehicle manipulation using near-infrared image processing
US-2019197330-A1 · Jun 27, 2019 · US
US12192586B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12192586-B2 |
| Application number | US-202217708876-A |
| Country | US |
| Kind code | B2 |
| Filing date | Mar 30, 2022 |
| Priority date | Sep 30, 2019 |
| Publication date | Jan 7, 2025 |
| Grant date | Jan 7, 2025 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A method for presenting a face in a video call includes: obtaining a key feature point of a facial expression of a user based on a face image of the user; driving a three-dimensional (3D) head image of the user by using the key feature point of the facial expression of the user, to obtain a target 3D avatar of the user, where the target 3D avatar of the user has an expression of the user; rotating the target 3D avatar based on a preset presentation angle, to obtain a target 3D avatar at the preset presentation angle; and transmitting the target 3D avatar at the preset presentation angle to a peer video call device. A user can see a 3D avatar that is of a user and that is at a preset presentation angle in real time.
Opening claim text (preview).
What is claimed is: 1. A method comprising: obtaining, based on a face image of a user in a video call process, a key feature point of a facial expression of the user, wherein the face image comprises N infrared images of the user and a color face image of the user, and wherein N is an integer greater than 0; obtaining, based on the N infrared images and the color face image, a three-dimensional (3D) head point cloud information of the user; constructing, based on the 3D head point cloud information, a 3D head image of the user; driving, using the key feature point, the 3D head image to obtain a first target 3D avatar of the user, wherein the first target 3D avatar has an expression of the user; rotating, based on a preset presentation angle, the first target 3D avatar to obtain a second target 3D avatar at the preset presentation angle; and sending, to a peer video call device, the second target 3D avatar. 2. The method of claim 1 , further comprising: obtaining, based on the color face image, a face texture feature of the user; and further constructing, based on the face texture feature, the 3D head image that is a color image, wherein the 3D head point cloud information comprises first 3D head point cloud information of the user or second 3D head point cloud information of the user. 3. The method of claim 1 , further comprising: inputting the color face image and the N infrared images into a feature extraction model to obtain 3D head point cloud information of the user and a face texture feature of the user; and further constructing, based on the 3D head point cloud information and the face texture feature, the 3D head image that is a color image. 4. The method of claim 3 , wherein the feature extraction model comprises a 3D head feature extraction network and a texture feature extraction network, and wherein the method further comprises: further inputting the color face image and the N infrared images into the 3D head feature extraction network to obtain the 3D head point cloud information; and inputting the color face image into the texture feature extraction network to obtain the face texture feature. 5. An apparatus comprising: a non-transitory memory configured to store instructions; and a processor coupled to the non-transitory memory, wherein when executed by the processor, the instructions cause the apparatus to: obtain, based on a face image of a user in a video call process, a key feature point of a facial expression of the user, wherein the face image comprises N infrared images of the user and a color face image of the user, and wherein N is an integer greater than 0; obtain, based on the N infrared images and the color face image, a three-dimensional (3D) head point cloud information of the user; construct, based on the 3D head point cloud information, a 3D head image of the user; drive, using the key feature point, the 3D head image to obtain a first target 3D avatar of the user, wherein the first target 3D avatar has an expression of the user; rotate, based on a preset presentation angle, the first target 3D avatar to obtain a second target 3D avatar at the preset presentation angle; and send, to a peer video call device, the second target 3D avatar. 6. The apparatus of claim 5 , wherein when executed by the processor, the instructions further cause the apparatus to: obtain, based on the color face image, a face texture feature of the user; and further construct, based on the face texture feature, the 3D head image that is a color image, wherein the 3D head point cloud information comprises first 3D head point cloud information of the user or second 3D head point cloud information of the user. 7. The apparatus of claim 5 , wherein when executed by the processor, the instructions further cause the apparatus to: input the color face image and the N infrared images into a feature extraction model to obtain 3D head point cloud information of the user and a face texture feature of the user; and further construct, based on the 3D head point cloud information and the face texture feature, the 3D head image that is a color image. 8. The apparatus of claim 7 , wherein the feature extraction model comprises a 3D head feature extraction network and a texture feature extraction network, and wherein when executed by the processor, the instructions further cause the apparatus to: input the color face image and the N infrared images into the 3D head feature extraction network to obtain the 3D head point cloud information; and input the color face image into the texture feature extraction network to obtain the face texture feature. 9. The apparatus of claim 8 , wherein the 3D head feature extraction network is a neural network that uses an encoder-decoder architecture, and wherein when executed by the processor, the instructions further cause the apparatus to: obtain, based on the color face image and the N infrared images, N image pairs, wherein each of the N image pairs comprises the color face image and an infrared image of the user from the N infrared images; and input the N image pairs into the neural network to obtain the 3D head point cloud information. 10. The apparatus of claim 9 , wherein when executed by the processor, the instructions further cause the apparatus to obtain, based on the N infrared images, the preset presentation angle. 11. The apparatus of claim 5 , wherein the face image is a color depth image, and wherein when executed by the processor, the instructions further cause the apparatus to: obtain, based on the color depth image, 3D head point cloud information of the user and a face texture feature of the user; and construct, based on the 3D head point cloud information and the face texture feature, the 3D head image that is a color image. 12. The apparatus of claim 11 , wherein when executed by the processor, the instructions further cause the apparatus to obtain, based on the color depth image, the preset presentation angle. 13. A vehicle comprising: a video call system comprising: a processor configured to: obtain, based on a face image of a user in a video call process, a key feature point of a facial expression of the user, wherein the face image comprises N infrared images of the user and a color face image of the user, and wherein N is an integer greater than 0; obtain, based on the N infrared images and the color face image, a three-dimensional (3D) head point cloud information of the user; construct, based on the 3D head point cloud information, a 3D head image of the user; drive, using the key feature point, the 3D head image to obtain a first target 3D avatar of the user, wherein the first target 3D avatar of the user has an expression of the user; rotate, based on a preset presentation angle, the first target 3D avatar to obtain a second target 3D avatar at the preset presentation angle; and transmit the second target 3D avatar; and a communications apparatus coupled to the processor and configured to: receive the second target 3D avatar; and send, to a peer video call device, the second target 3D avatar. 14. The vehicle of claim 13 , wherein the processor is further configured to: obtain, based on the color face image, a face texture feature of the user; and further construct, based on the face texture feature, the 3D head image that is a color image, wherein the 3D head point cloud information comprises first 3D head point cloud information of the user or second 3D head point cloud information of the user. 15. The vehicle of claim 13 , wherein the processor is further configured to: input the color face ima
for generating image signals from visible and infrared light wavelengths · CPC title
Feature extraction; Face representation · CPC title
using acquisition arrangements · CPC title
inside of a vehicle, e.g. relating to seat occupancy, driver state or inner lighting conditions · CPC title
Recognition of moving objects or obstacles, e.g. vehicles or pedestrians; Recognition of traffic objects, e.g. traffic signs, traffic lights or roads · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.