Signalling of video content including sub-picture bitstreams for video coding
US-2018277164-A1 · Sep 27, 2018 · US
US12075029B2 · US · B2
| Field | Value |
|---|---|
| Publication number | US-12075029-B2 |
| Application number | US-202217959088-A |
| Country | US |
| Kind code | B2 |
| Filing date | Oct 3, 2022 |
| Priority date | Apr 3, 2018 |
| Publication date | Aug 27, 2024 |
| Grant date | Aug 27, 2024 |
A practical reading order for non-experts. Skip the full description unless you need deep technical detail.
What the patent document calls the invention.
A short plain-language summary of the technical disclosure.
Who owns or filed the patent and who is credited as inventor.
Filing, priority, publication, and grant dates set the timeline.
The legal scope of protection — read this for what is actually claimed.
Technology tags used to group this patent with similar filings.
Prior art links and similar publications in this corpus.
Official abstract text for this publication.
A video coding mechanism for viewpoint dependent video coding is disclosed. The mechanism includes mapping a spherical video sequence into a plurality of sub-picture video sequences. The mechanism further includes encoding the plurality of sub-picture video sequences as sub-picture bitstreams to support merging of the plurality of sub-picture bitstreams, the encoding ensuring that each sub-picture bitstream is self-referenced and two or more of the sub-picture bitstreams can be merged to generate a single video bitstream using a lightweight bitstream rewriting process that does not involve changing of any block-level coding results. A mergeable indication is encoded to indicate that the sub-picture bitstream containing the indication is compatible with a multi-bitstream merge function for reconstruction of the spherical video sequence. A set of the sub-picture bitstreams and the mergeable indication are transmitted toward the decoder to support decoding and displaying a virtual reality video viewport.
Opening claim text (preview).
What is claimed is: 1. A method implemented by a decoder, the method comprising: receiving a bitstream comprising a temporal motion-constrained tile set (MCTS) supplemental enhancement information (SEI) message for a picture and an active picture parameter set (PPS) containing a tiles_enabled_flag set to zero to indicate the picture contains only one tile, wherein a value of an each_tile_one_tile_set_flag is set to one, wherein an entropy_coding_sync_enabled_flag is set to zero for the PPS when wavefront parallel processing (WPP) is not in use, wherein the picture is partitioned into sub-pictures, and wherein a same picture order count (POC) value is set for all sub-pictures partitioned from a same picture; and decoding the bitstream based on the MCTS SEI message, the PPS, the tiles_enabled_flag, the each_tile_one_tile_set_flag, and the entropy_coding_sync_enabled_flag. 2. The method of claim 1 , wherein the temporal MCTS SEI message is used despite inclusion of a single tile in the picture of a spherical video sequence. 3. The method of claim 1 , wherein the bitstream is a sub-picture bitstream carried in a track. 4. The method of claim 1 , wherein the picture is coded by motion vectors that reference fractional sample locations that employ only full sample locations inside a same sub-picture video signal for interpolation, and not motion vectors that reference fractional sample locations that employ partial sample locations inside the same sub-picture video signal for interpolation. 5. The method of claim 1 , wherein the picture is further coded by motion vectors that reference full sample locations in a same sub-picture video signal for interpolation, and not motion vectors that reference full sample locations outside the same sub-picture video signal for interpolation. 6. The method of claim 1 , wherein the bitstream includes a sub-picture of the picture, and wherein decoding the bitstream includes decoding the sub-picture for display. 7. A decoder comprising: a receiver configured to receive a bitstream comprising a temporal motion-constrained tile set (MCTS) supplemental enhancement information (SEI) message for a picture and an active picture parameter set (PPS) containing a tiles_enabled_flag set to zero to indicate the picture contains only one tile, wherein a value of an each_tile_one_tile_set_flag is set to one, wherein an entropy_coding_sync_enabled_flag is set to zero for the PPS when wavefront parallel processing (WPP) is not in use, wherein the picture is partitioned into sub-pictures, and wherein a same picture order count (POC) value is set for all sub-pictures partitioned from a same picture; and a processor coupled to the receiver and configured to decode the bitstream based on the MCTS SEI message, the PPS, the tiles_enabled_flag, the each_tile_one_tile_set flag, and the entropy_coding_sync_enabled_flag. 8. The decoder of claim 7 , wherein the temporal MCTS SEI message is used despite inclusion of a single tile in the picture of a spherical video sequence. 9. The decoder of claim 7 , wherein the bitstream is a sub-picture bitstream carried in a track. 10. The decoder of claim 7 , wherein the picture is coded by motion vectors that reference fractional sample locations that employ only full sample locations inside a same sub-picture video signal for interpolation, and not motion vectors that reference fractional sample locations that employ partial sample locations inside the same sub-picture video signal for interpolation. 11. The decoder of claim 7 , wherein the picture is further coded by motion vectors that reference full sample locations in a same sub-picture video signal for interpolation, and not motion vectors that reference full sample locations outside the same sub-picture video signal for interpolation. 12. The decoder of claim 7 , wherein the bitstream includes a sub-picture of the picture, and wherein decoding the bitstream includes decoding the sub-picture for display. 13. A non-transitory computer readable medium comprising a computer program product for use by a video coding device, the computer program product comprising computer executable instructions stored on the non-transitory computer readable medium such that when executed by a processor cause the video coding device to: receive a bitstream comprising a temporal motion-constrained tile set (MCTS) supplemental enhancement information (SEI) message for a picture and an active picture parameter set (PPS) containing a tiles_enabled_flag set to zero to indicate the picture contains only one tile, wherein a value of an each_tile_one_tile_set_flag is set to one, wherein an entropy_coding_sync_enabled_flag is set to zero for the PPS when wavefront parallel processing (WPP) is not in use, wherein the picture is partitioned into sub-pictures, and wherein a same picture order count (POC) value is set for all sub-pictures partitioned from a same picture; and decode the bitstream based on the MCTS SEI message, the PPS, the tiles_enabled_flag, the each tile_one_tile_set_flag, and the entropy_coding_sync_enabled_flag. 14. The non-transitory computer readable medium of claim 13 , wherein the temporal MCTS SEI message is used despite inclusion of a single tile in the picture of a spherical video sequence. 15. The non-transitory computer readable medium of claim 13 , wherein the bitstream is a sub-picture bitstream carried in a track. 16. The non-transitory computer readable medium of claim 13 , wherein the picture is coded by motion vectors that reference fractional sample locations that employ only full sample locations inside a same sub-picture video signal for interpolation, and not motion vectors that reference fractional sample locations that employ partial sample locations inside the same sub-picture video signal for interpolation. 17. The non-transitory computer readable medium of claim 13 , wherein the picture is further coded by motion vectors that reference full sample locations in a same sub-picture video signal for interpolation, and not motion vectors that reference full sample locations outside the same sub-picture video signal for interpolation. 18. The non-transitory computer readable medium of claim 13 , wherein the bitstream includes a sub-picture of the picture, and wherein decoding the bitstream includes decoding the sub-picture for display.
Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction · CPC title
Decoders specially adapted therefor, e.g. video decoders which are asymmetric with respect to the encoder · CPC title
using parallelised computational arrangements · CPC title
the region being a picture, frame or field · CPC title
Position within a video image, e.g. region of interest [ROI] · CPC title
Related publications grouped by family.
Answers are generated from the same data shown on this page.