Why is it so difficult to connect users to...
Transcript of Why is it so difficult to connect users to...
![Page 1: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/1.jpg)
Luchelan
How using faster, cheaper and better ways of managing data can help geoscientists
achieve competitive advantage
Alan H Smith
Luchelan Limited
Connecting Subsurface, Drilling expertise with Digital TechnologyDigital Energy / Finding Petroleum, Kuala Lumpur, 4 October 2016
![Page 2: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/2.jpg)
Luchelan
Acknowledgements
![Page 3: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/3.jpg)
Luchelan
Agenda
• Introduction
• Changing people, process & technology
• Example - MultiClient seismic data
• Where next?
![Page 4: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/4.jpg)
Luchelan
Agenda
• Introduction
• Changing people, process & technology
• Example - MultiClient seismic data
• Where next?
![Page 5: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/5.jpg)
LuchelanStatus in 1991?
After Tonstad, 2002
![Page 6: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/6.jpg)
Luchelan
Status 2015
TimespentonETL8%
Timespentdatacleaning12%
BasicExplDataAnal16%
Machinelearning/Stats12%Crea ngvisuals
11%
Presen ng9%
Timespentinmee ngs15%
Holidays7%
Coffeeetc6%
Training4%
Based on “Time spent on Data Science” (O’Reilly, 2016)
Examples from Analytics in E&P (Courtesy Teradata)
Well data example• 50% of time spent
preparing dataSeismic / Navigation data example.• 80% of time spent finding
& preparing data
![Page 7: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/7.jpg)
Luchelan
Agenda
• Introduction
• Changing people, process & technology
• Example - MultiClient seismic data
• Where next?
![Page 8: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/8.jpg)
Luchelan
Technology in the 90s
8
![Page 9: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/9.jpg)
Luchelan
What we were doing
• Did we have the technology capable of managing data types?
• Projects to get data into suitable systems
• The start of National Data Repositories– CDA, Diskos
• Efficiency discussed –but did things actually improve?
People
Technology
Process
![Page 10: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/10.jpg)
Luchelan
Technology about 2005
![Page 11: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/11.jpg)
Luchelan
Holditch – 2002 SPE President
“Our members have changing needs and expectations," Holditch said. "Technical information needs to be available ‘on demand’. Easy and efficient access to technical knowledge is key to success for today's E&P professionals.”
SPE 78337
![Page 12: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/12.jpg)
Luchelan Evaluation of the DISKOS project
The tangible benefits exeeds the cost!
Cost reduction in 1999
94 95 96 97 98 99 00
Cost ac 2,1 4,8 9,1 13,9 19,2 26,2 33,5
Benerfit ac 0 0 0 5 15 42,4 69,8
After Tonstad, 2002
![Page 13: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/13.jpg)
Luchelan
What we were doing
• Did we have the the processes in place?
• Understanding that services were needed not just projects
• Struggling with corporate / master / project data management
• Efficiency discussed –but did things actually improve? People Technology
Process
![Page 14: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/14.jpg)
Luchelan
Technology from ~2015
14
![Page 15: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/15.jpg)
Luchelan
What we were doing
• People to the fore• Understanding that getting the
right people with the appropriate skills is critical
• More acceptance of the need for good data management in organisations
• Efficiency still needs to be improved especially with low oil price
• Is technology making a comeback?– Web / Cloud based systems– High bandwidth
communications– Legacy is still an issue
People
TechnologyProcess
![Page 16: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/16.jpg)
Luchelan
The legacy problem
2006 –2010 saw another 12 (upgrades)
2011 to mid 2016another 8 (upgrades)
Total of 230 or more media types
![Page 17: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/17.jpg)
Luchelan
Agenda
• Introduction
• Changing people, process & technology
• Example - MultiClient seismic data
• Where next?
![Page 18: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/18.jpg)
Luchelan
The statistics – PGS MultiClient
• Important Balance Sheet item
• Significant proportion of vessel time
• Significant revenues
• Huge data volumes acquired
• Long shelf life
• Shelf life “reset” with reprocessing etc
• Pre- and Post-Stack and Ancillary products all equally important
• Increasing demand for prestack products
Data from PGS Annual reports
41%45%48%
40%
34%
60%
![Page 19: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/19.jpg)
Luchelan
PGS MultiClient Data delivery
• Heritage system developed in 1990s– Outsourced service provision
– Slow & restricted functionality by current standards
– Only really handled post stack
• New system – Still outsourced
– Trace handling, not processing (sensu stricto)
– Handles pre and post stack data efficiently
– Modern database integrated with IT infrastructure and other enterprise software systems
![Page 20: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/20.jpg)
Luchelan
Multi client seismic management
Field & Interim products
Final productsOld
New
![Page 21: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/21.jpg)
Luchelan
Loading & QC
![Page 22: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/22.jpg)
Luchelan
Delivery & QC
![Page 23: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/23.jpg)
Luchelan
Multiclient complications
Survey extent
Company A
Company B
Company C
Company DWhat are they getting?• Prestack (options)• Stack• Migration• Velocities• …
All need cutting to correct coordinates
NowAutomaticParallel processing
HistoricManual handling Manual intervention
![Page 24: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/24.jpg)
Luchelan
The impact
![Page 25: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/25.jpg)
Luchelan
Ensuring availability
PGS Houston
PGS London
10Gb
Ovation Houston
Ovation London
10Gb 10Gb
10Gb
Client
Client
![Page 26: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/26.jpg)
Luchelan
Agenda
• Introduction
• Changing people, process & technology
• Example - MultiClient seismic data
• Where next?
![Page 27: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/27.jpg)
Luchelan
Data storage
• More data per unit area• Faster• Cheaper
Data transferCable everywhere (nearly) –massive capacitySatellite – filling some holes at predicted speeds approaching 1Gb/s
![Page 28: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/28.jpg)
Luchelan
Leave the data where it is
Format “A”
Format “B”Format “C”
![Page 29: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/29.jpg)
Luchelan
Conclusions
• Large volumes of data can be live on the internet
– QC is essential
– Automate what you can
• Next steps
– Take the application to the data
![Page 30: Why is it so difficult to connect users to data?9bc7c402577152ea1941-3af9e85018fdc836628ce2df369c2d63.r91... · 2016-10-06 · Status 2015 Time spent on ETL 8% Time spent data cleaning](https://reader035.fdocuments.us/reader035/viewer/2022070917/5fb73095ebcf5432554158fe/html5/thumbnails/30.jpg)
Luchelan
Acknowledgements