PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · •...
Transcript of PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · •...
![Page 1: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/1.jpg)
GTC 2017
![Page 2: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/2.jpg)
The Smartvid.io solution
OUR MISSION
we're unlocking the value of photos and videos to dramatically improve safety, quality and productivity in the AEC industry.
@
![Page 3: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/3.jpg)
An untapped resource
MEDIA FROM THE FIELD
The amount of pictures & videos captured every day in the field keeps getting bigger.
50 GB of data is generated on the typical project.
Much of it ends up unused, siloed across different systems and devices.
![Page 4: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/4.jpg)
How it works
![Page 5: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/5.jpg)
![Page 6: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/6.jpg)
WE’RE USING MACHINE LEARNING TO AUTOMATICALLY IDENTIFY
“WHAT’S IN” CONSTRUCTION PHOTOS AND VIDEOS…
![Page 7: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/7.jpg)
The results
IMPACT
2016 Annual AI for Safety Photo Contest Typical Construction Project
# REVIEWED 15,000 photos
HUMAN EXPERT TIME 80 days
SMARTVID.IO TIME ~8 days
# REVIEWED 1,080 photos
HUMAN EXPERT TIME 4.5 hours
SMARTVID.IO TIME <10 minutes
![Page 8: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/8.jpg)
STRATEGY
Exponential Data Growth
![Page 9: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/9.jpg)
• Basic: Object recognition• Is object present in image, Yes/No?
• Example: Is there scaffolding in this picture? (Yes/No)
• How used: image search within and across projects for key imagery (e.g., find me scaffolding images b/c I’m looking at a bill for scaffolding and want to check it)
• Advanced: Object analytics and logic• Where are the objects? How many of them are there? What is
their volume? (Quantitative)
• Examples: Is each person wearing high vis safety gear? What is the location and volume of visual defects like cracks?
• How used: identifying and quantifying visual data • Safety (Hard hats, safety vests, more) , Quality (Cracks, more)
Our deep learning for…
IMAGE RECOGNITION
![Page 10: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/10.jpg)
EXAMPLE: ADVANCED IMAGE RECOGNITION FINDS PEOPLE (1) THEN DETERMINES IF
THEY ARE SAFE (2), THUS “FOCUSING” THE AI
![Page 11: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/11.jpg)
QUANTITATIVE DATA IS AVAILABLE FROM OUR COMPUTER VISION
LINEAL EXTENT OF CRACK INTEGRITY MEASURE
![Page 12: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/12.jpg)
And deep learning for…
SPEECH RECOGNITION
• Industry keywords automatically detected from speech in video
• Tags are linked to timeline of video for instant retrieval and easy sharing or collaboration
• How used
– Field worker narrates video using Smartvid.ioapp or native IOS or Android device
– Office user (manager) can search by keyword
– Example: see all installation of blocking, by location
![Page 13: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/13.jpg)
How it works
OUR TECHNOLOGY
Multiple AWS P2 instances for model training & runtime execution
Full spectrum deep learning for computer vision & speech
• 5-10+ instances at peak training
COMMODITYFind objects of interest
Locate & segment objects
PROPRIETARY
STATE OF THE ART
Multi-model & focal point approach
Quantify objects
![Page 14: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/14.jpg)
SYSTEMSARCHITECTURE
![Page 15: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/15.jpg)
IMAGE MLSTACK
![Page 16: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/16.jpg)
ML AT SCALE
• Gain access to data • Manage data access (ingestion) • Clean data• Manage data • Build data sets for training and evaluation
![Page 17: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/17.jpg)
MLINFRASTRUCTURE
![Page 18: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/18.jpg)
ALTERNATESYSTEMSARCHITECTURE
![Page 19: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/19.jpg)
CONCLUSION
• AEC industry is creating tremendous amounts of visual and audio data • Deep learning can unlock value for safety, quality, productivity • New techniques must be applied to handle complexity of imagery and
scale of data
Come by the Dell Booth to see Smartvid.io in action. Case studies available on cracks and hard hats at www.smartvid.io.
Josh Kanner, [email protected] True, [email protected]
![Page 20: PowerPoint Presentationon-demand.gputechconf.com/gtc/2017/presentation/s7575-Josh-Kann… · • Basic: Object recognition ... •5-10+ instances at peak training Find objects of](https://reader031.fdocuments.us/reader031/viewer/2022030407/5a876ef47f8b9a001c8dc83e/html5/thumbnails/20.jpg)
Where things are going…