Hand pointing estimation for human computer interaction
US-2015378444-A1 · Dec 31, 2015 · US
US9858475B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-9858475-B2 |
| Application number | US-78055710-A |
| Country | US |
| Kind code | B2 |
| Filing date | May 14, 2010 |
| Priority date | May 14, 2010 |
| Publication date | Jan 2, 2018 |
| Grant date | Jan 2, 2018 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
In a minimally invasive surgical system, a plurality of video images is acquired. Each image includes a hand pose image. Depth data for the hand pose image is also acquired or synthesized. The hand pose image is segmented from the image using the depth data. The segmented image is combined with an acquired surgical site image using the depth data. The combined image is displayed to a person at a surgeon's console of the minimally invasive surgical system. Processing each of the video images in the plurality video images in this way reproduces the hand gesture overlaid on the video of the surgical site in the display.
Opening claim text (preview).
We claim: 1. A method comprising: receiving an acquired video image at a controller, wherein the acquired video image comprises a hand pose image of a hand gesture; segmenting, by the controller, the hand pose image from the acquired video image to obtain a segmented hand pose image, the segmenting comprising a depth threshold process and a flood fill process, the depth threshold process and the flood fill process each using depth data for pixels in the acquired video image, the depth threshold process eliminating pixels having a depth greater than a maximum depth threshold from the acquired video image to obtain a first modified data frame, and the flood fill process processing the first modified data frame to obtain a hand pose mask; combining, in real time by the controller, the segmented hand pose image with an image of a surgical site to obtain a combined image, wherein the combining comprises using an alpha mask; and sending the combined image from the controller to a display device, wherein the display device is included in a console; wherein the console comprises a master manipulator; and wherein the segmenting the hand pose image further comprises the controller using information characterizing the master manipulator in the segmenting, wherein the information is different from depth data. 2. The method of claim 1 , wherein the information includes a static image of the master manipulator. 3. The method of claim 1 , wherein the information includes kinematic data for a position of the master manipulator. 4. A method comprising: receiving an acquired video image at a controller, wherein the acquired video image comprises a hand pose image of a hand gesture; segmenting, by the controller, the hand pose image from the acquired video image to obtain a segmented hand pose image, the segmenting comprising a depth threshold process and a flood fill process, the depth threshold process and the flood fill process each using depth data for pixels in the acquired video image, the depth threshold process eliminating pixels having a depth greater than a maximum depth threshold from the acquired video image to obtain a first modified data frame, and the flood fill process processing the first modified data frame to obtain a hand pose mask; combining, in real time by the controller, the segmented hand pose image with an image of a surgical site to obtain a combined image, wherein the combining comprises using an alpha mask; and sending the combined image from the controller to a display device, wherein the display device is included in a console; wherein the console comprises a master manipulator; and wherein the method further comprises: parking the master manipulator to provide an unobstructed volume in which to make the hand gesture.
Image segmentation details · CPC title
Mixed reality (object pose determination, tracking or camera calibration for mixed reality G06T7/00) · CPC title
Physics · mapped topic
Static hand or arm · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.