MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing...
-
Upload
christina-stephens -
Category
Documents
-
view
216 -
download
0
Transcript of MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing...
![Page 1: MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.](https://reader035.fdocuments.us/reader035/viewer/2022072114/56649d005503460f949d2cff/html5/thumbnails/1.jpg)
MPEG-4
• Objective• Standardize algorithms for audiovisual coding in
multimedia applications allowing for• Interactivity
• High compression
• Scalability of audio and video content
• Support for natural and synthetic audio and video
• The Idea• An audiovisual scene is a coded representation of
audiovisual objects related in space and time
![Page 2: MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.](https://reader035.fdocuments.us/reader035/viewer/2022072114/56649d005503460f949d2cff/html5/thumbnails/2.jpg)
MPEG-4: Scenario
• A/V object• A video object within a scene
• The background
• An instrument or voice
• Coded independently
• A/V scene• Mixture of natural or synthetic objects
• Individual bitstreams multiplexed and transmitted
• One or more channels
• Each channel may have its own quality of service
![Page 3: MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.](https://reader035.fdocuments.us/reader035/viewer/2022072114/56649d005503460f949d2cff/html5/thumbnails/3.jpg)
MPEG-4: Video Object Plane
• Video frame = sum of segmented regions with arbitrary shape (VOP)
• Shape motion and texture information of VOPs belonging to the same video object is encoded into a video object layer (VOL)
• Encode• VOL identifiers• Composition information
• Overlapping configuration of VOPs
![Page 4: MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.](https://reader035.fdocuments.us/reader035/viewer/2022072114/56649d005503460f949d2cff/html5/thumbnails/4.jpg)
MPEG-4: Coding
• Shape coding• Shape information in alpha planes
• Transparency of shape encoded
• Inter and intra shape coding functions
• After shape coding each VOP in a VO is partitioned into non-overlapping macroblocks
• Motion coding• Shift parameter wrt reference window
• Standard macroblock
• Contour macroblock
![Page 5: MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.](https://reader035.fdocuments.us/reader035/viewer/2022072114/56649d005503460f949d2cff/html5/thumbnails/5.jpg)
MPEG-4: Coding• Texture coding
• Intra-VOPs, residual errors from motion compensation are DCT coded like MPEG-1
• 4 luminance and 2 chrominance blocks in a macroblock
• P-VOPs (prediction error blocks) may not conform to VOP boundary
• Pixels outside the active area are set to a constant value
• Standard compression
• Efficient prediction of DC and AC components from intra and inter coded blocks
• Multiplexing• Shape motion texture coded data
• Motion and DCT coefficients can be jointly (H.263) or individually coded
![Page 6: MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.](https://reader035.fdocuments.us/reader035/viewer/2022072114/56649d005503460f949d2cff/html5/thumbnails/6.jpg)
MPEG-4 Video Object Segmentation-I
• Construct a video object• User selects start frame, outlines polygon designating rough object
boundary• Refine boundary using snake algorithm, if needed
• Compute a k-pixel bounding box around the object• Within bounding box compute
• Edge map: bit plane, after thresholding a convolution kernel• Color map: compute luminance and chrominance, quantize by k-
means clustering, keep quantization table• Motion field: block-based motion vector
• Segment into regions no significant edge, smooth color having smooth motion
• Intersect segments and initial object boundary and determine foreground and background region
• Estimate the motion of regions in the next frame with an affine motion model
![Page 7: MPEG-4 Objective Standardize algorithms for audiovisual coding in multimedia applications allowing for Interactivity High compression Scalability of audio.](https://reader035.fdocuments.us/reader035/viewer/2022072114/56649d005503460f949d2cff/html5/thumbnails/7.jpg)
MPEG-4 Video Object Segmentation-II
• Track object• Locate estimated position of foreground and background regions
from previous frame. Call this the object mask.
• Generate same three feature maps with the quantization table; Requantize if error is large
• Classify regions into foreground/background and new regions
• Intersection ratio r with object mask
• For foreground regions, if r > 80% OR foreground mask, mark as foreground; label foreground - mask as new
• For new regions, if r < 30% mark as new; if r > 80% mark as foreground; else find nearest-motion-similar neighbor. If it is in the foreground, do previous step, else keep region as new
• Iterate until stable