MAR Reference Model - Web3D Consortium...Aural (Intensity, Pitch, …) Haptic (Force, Direction,...
Transcript of MAR Reference Model - Web3D Consortium...Aural (Intensity, Pitch, …) Haptic (Force, Direction,...
Page 1
MAR Reference Model
Results of the 4th MAR JAhG Meeting hold in
Seoul, January 2014
Author, Co-Chair: Marius Preda, TELECOM SudParis, SC29
Presenter: Don Brutzman, Web3D Consortium and NPS
Definition and architecture
Page 2
Direct perception
Computer mediated perception
AR
Definition
MAR focus
Page 3
MAR Engine
Sensors /
Actuators
MAR Scene
Descriptions
Additional
Media
Services
AVH Display
/ UI
AR
Global Architecture
MAR Engine
Sensors /
Actuators
MAR Scene
Descriptions
Additional
Media
Services
AVH Display
/ UI
AR
Enterprise Viewpoint
AR Authoring Tools Creator (ARATC)
AR Experience Creator (AREC)
Content Creator (CC)
CA
Content Aggregator (CA)
DMCP
Telecommunication Operator (TO)
TO
Service Middleware/Component
Provider (SMCP)
AR Service Provider (ARSP)
Device Manufacturer (DM)
Device Middleware/Component Provider (DMCP)
DM
End-User Profile (EU)
EUP
DMCP
CC
CC
TO
TO
TO
ARSP
SMCP
AREC ARATC
Page 4
MAR Engine
MAR Scene
Descriptions
Additional Media
Services
AVH Display
/ UI
AR
Computational Viewpoint
Actuator
Device
Real World
AVH Capture
Spatial
MapperScene Graph
Engine
AVH
RendererRec/Tracker
Context
Mapper
Direct perception
Computer mediated
perception
AR
Page 5
Component Dimension Types
Real World AVH Capture
Dimension (compute the
context)
1. Modality Visual Auditory Electro-magnetic
waves (e.g. GPS)
Temperature Other physical
properties
2. Source type Live Pre-captured
Real World AVH Capture
Dimension (contribute to
composition)
1. Modality Visual Auditory Haptics properties Other
2. Form of Visual Modality Stil image 2D Video 3D Video (video +
depth)
3D mesh Other
3. Source type Live Pre-captured
Recognizer 1. Form of Target Signal Image patch 3D primitives 3D Model Earth- reference
coordinates
None
2. Form of the Output Event Recognized or not Additional data: Type, Timestamp,
Recognition confidence level, other
attributes
3. Execution place Local Remote
Tracker 1. Form of Target Signal Image patch 3D primitives 3D Model Earth- reference
coordinates
None
2. Form of the Output Event Spatial (2D, 3D, 6D, …) Aural (Intensity, Pitch, …) Haptic (Force,
Direction, …)
3. Execution place Local Remote
Actuator 1. Modality Motion Temperature Lighting Object shapes Other
2. Execution place Local Remote
Scene Graph Engine 1. Space & time 2D + t 3D + t
2. User Interactivity Yes No
3. Execution place Local Remote Hybrid
4. Number of simultanous users Single-user Multi-user
AVH Renderer 1. Modality Visual Aural Haptics Other
2. Execution place Local Remote Hybrid
Visual Display 1. Presentation optical see through video see through projection
2. Mobility Fixed Mobile Controlled
3. No of channels 2D (mono) 3D stereoscopic 3D holographic
Aural Display 1. No of channels Mono Spatial
2. Acoustic space coverage Headphones Speaker
Haptics Display 1. Type Vibration Pressure Temperature Other
MAR Reference Model
Component based classification system
Terminology
Page 6
Definition of a set of terms
Terminology
Usage cases
Page 7
MAR Reference Model
Modeling of 6 state of the art AR use cases:
1. Real-time, local detection, no registration2. Real-time, local detection, local registration
3. Real-time, remote detection, no registration4. Real-time, remote detection, remote registration
5. Real-time, remote detection, local registration
6. Real-time, remote detection, registration and augmentation, local presentation
Local vs Remote
MAR Reference Model
Points of Interests
Modeling of 2 AR use cases using Point of Interests:
1. Content embedded POIs2. Server available POIs
Page 8
MAR Reference Model
2D vs 3D video
Modeling of 4 AR use cases using 3D video:
1. Real-time, local depth estimation, condition based augmentation2. Real-time, local depth estimation, model based augmentation
3. Real-time, remote depth estimation, condition based augmentation4. Real-time, remote depth estimation, model based augmentation
MAR Reference Model
Stereo vs 3D audio
Modeling of 2 AR use cases using 3D audio:
1. Real-time, spatial audio based in intensity2. Real-time, 3D audio based HRTF (Head-related Transfer Function)
Page 9
MAR Reference Model
Use cases
Local vs RemotePoint of Interests3D video3D audio
MAR Reference Model
Use cases
Local vs RemotePoint of Interests3D video3D audio
Page 10
Content
Designer
Target Images/
Set of descriptorsScene
MAR Browser
Detection
Library
MAR device
Real world
capture
Camera frames
Target Images/Descriptors
ID Mask
Camera
MAR Reference Model
1. Real-time, local detection, no registration
Content
Designer
Target Images/
Set of descriptorsScene
AR Browser
Detection &
Library
Detection &
Tracking
Library
Mobile device
Camera frames
Target Images/Descriptors
ID Mask
+ TM
Augmentation media
TM = Transformation Matrix
MAR Reference Model
2. Real-time, local detection, local registration
Real world
captureCamera
Page 11
Content
Designer
Target Images/
Set of descriptorsScene
AR Browser
Detection
Library
Mobile device
Video stream/timed images
Timestamp + ID Mask
Processing
Server
Processing Server URLTarget Images/Descriptors + IDs
MAR Reference Model
3. Real-time, remote detection, no registration
Real world
capture Camera
Content
Designer
Target Images/
Set of descriptorsScene
AR Browser
Detection &
Library
Detection &
Tracking
Library
Mobile device
Video stream/timed images
Timestamp + ID Mask
+ TM
Processing
Server
Augmentation media
Processing Server URLTarget Images/Descriptors + IDs
MAR Reference Model
4. Real-time, remote detection, remote registration
Real world
captureCamera
TM = Transformation Matrix
Page 12
Content
DesignerScene
AR Browser
Detection
Library
Mobile device
Video stream/timed images
Initial Region +
Augmentation Media
Processing
Server
Rectangle = Target Image to be detected locally
Processing Server URL
Large
Image DB
Augmentation
Media
MAR Reference Model
5. Real-time, remote detection, local registration
Tracking
Library
Real world
captureCamera
Content
DesignerScene
AR Browser
Detection
& Tracking
Library
Mobile device
Video stream/timed images
Processing
Server
Processing Server URL
Target Images/
Set of descriptors
Composed
Stream
Composed Stream = the Processing Server composes the Video and the Augmentation
Media and sends back the Augmented Stream
Augmentation Media
+ Video
MAR Reference Model
6. Real-time, remote registration and detection, local presentation
Real world
capture Camera
Page 13
MAR Reference Model
Use cases
Local vs RemotePoint of Interests3D video3D audio
MAR Reference Model
1. Content embedded POIs
Content
Designer
POIsScene
AR Browser
Scene/World
Registration
Scene/World
coordinates
Registration
Mobile device
TM
Augmentation media
TM = Transformation Matrix
Real world
captureCamera
Map service
Sensors
Page 14
MAR Reference Model
2. Server available POIs
Content
Designer
POIs server addressScene
AR Browser
Scene/World
Registration
Scene/World
coordinates
Registration
Mobile device
TM
Augmentation parameters
TM = Transformation Matrix
Real world
captureCamera
Map service
Sensors
POI + content
server
MAR Reference Model
Use cases
Local vs RemotePoint of Interests3D video3D audio
Page 15
MAR Reference Model
1. Real-time, local depth estimation, condition based
augmentation
Content
DesignerAR Scene
Camera
right
Camera
left
Stereoscopic images(+ camera parameters)
Depth
estimation
Image + depth
Condition
Scale
Condition
detector
AR Browser
Mobile device
Real world
capture
MAR Reference Model
2. Real-time, local depth estimation, model based
augmentation
Content
DesignerStereoscopic images(+ camera parameters)
Depth estimation +
Detection Library
3D approx.
of real world
AR Browser
Mobile device
Camera
right
Camera
left
AR Scene
Real world
capture
Transformation matrix
of the camera in real world
Page 16
MAR Reference Model
3. Real-time, remote depth estimation, condition based
augmentation
Content
Designer
Stereoscopic images(+ camera parameters)
Depth Depth
estimation
+
Detection
Library
Depth (+transformation matrix)
AR Browser
Mobile device
Camera
right
Camera
left
AR Scene
Condition
Orientation
+ Scale
Processing
Server
Condition
detector
Processing server URL
Real world
capture
MAR Reference Model
4. Real-time, remote depth estimation, model based
augmentation
Content
Designer
Stereoscopic images(+ camera parameters)
Depth Depth
estimation
+
Detection
Library
AR Browser
Camera
right
Camera
left
AR Scene
Real world
capture
Processing server URL
Processing
Server
3D approx.
of real world
Transformation matrix
of the camera in real world
Mobile device
Page 17
Get involved in MAR Reference Model
1. Stakeholders and participants• MAR Reference Model is intended to become an ISO standard
• Animated by SC24/WG9 and SC29/WG11
• Contributions from Web3D, ARS, OGC
• Open to all interested in developing an open and free standard
2. ISO Intellectual property rights policy• MAR Reference model will be published by ISO under the royalty free
policy
3. How to get involved• Participate to meetings of any standard organization involved (ISO,
Web3D, OGC)
• Direct contributions on http://wg11.sc29.org/trac/augmentedreality
4. Contact information• Marius Preda ([email protected])
• Gerry Kim ([email protected])
X3D is a royalty-free open-standard file format
• Communicate animated 3D scenes using XML
• Run-time architecture for consistent user interaction
• ISO-ratified standard for storage, retrieval and playback of
real-time graphics content
• Enables real-time communication of 3D data across
applications: archival publishing format for Web
• Rich set of componentized features for engineering and
scientific visualization, CAD and architecture, medical
visualization, training and simulation, multimedia,
entertainment, education, and more
What is Extensible 3D (X3D)?
Page 18
• X3D version 4.0 will support the MAR Reference model
and HTML5/DOM/X3DOM
– Much work is complete already
• X3D AR working group cochairs:
– Gun Lee, University of New Zealand
– Timo Engelke, Fraunhofer
• Speaker contact, X3D working group cochair:
– Don Brutzman, Naval Postgraduate School
– [email protected] cell +1.831.402.4809
X3D AR