FutureTDM Workshop II 29 March
-
Upload
futuretdm -
Category
Data & Analytics
-
view
116 -
download
0
Transcript of FutureTDM Workshop II 29 March
OpenDataMonitor
Horizon 2020
Coordination and Support Action
GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)
Project number: 665940
Increasing Uptake of Text and Data Mining in the EU
FutureTDMReducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments
using a Collaborative Knowledge and Open Information Approach
Melanie Imming, LIBER, FutureTDM Workshop, European Parliament, Brussels
29 March 2017
What we are talking about today: an overview of the Workshop
A reminder of what we mean by TDM
An overview of the project and its aims
Presentations on our most recent Reports
− Economics of TDM
− Policy Recommendations
Later today:
Session 2: What stakeholders can do to improve TDM in their
sector – developing guidelines
2FutureTDM
“the discovery by computer of new, previously unknown information, by automatically extracting and relating information from different (…)resources, to reveal otherwise hidden meanings” (Hearst, 1999)
What is TDM?
3
ICT Availability
Computing
Power
Data
16 trillion
gigabytes of
data by 2020
(236% growth)
Doubles every
2 years
(Moores Law,
1965)
Over 80% EU
citizens have
internet access
(Eurostat 2014)
Potential of TDM
4
• Addressing grand challenges such as climate change and global
epidemics
• Improving population health, wealth and development
• Creating new jobs and employment
• Exponentially increasing the speed and progress of science through new
insights and greater efficiency of research
• Increasing transparency of governments and their actions
• Fostering innovation and collaboration and boosting the impact of open
science
• Creating tools for education and research
• Providing new and richer cultural insights
• Speeding economic and social development in all parts of the globe
(The Hague Declaration on Knowledge Discovery)
5
TDM is not a homogeneous, self-contained, scientific domain, but rather a diverse and complex set of methods and technologies
deployed in the framework of diverse disciplines and business activities
The challenge
FutureTDM - the opportunity
The FutureTDM project seeks to improve uptake of text and data mining
(TDM) in the EU by actively engaging with stakeholders such as
researchers, developers, publishers and SMEs.
The use of content mining is significantly lower in Europe
than in some American and Asian countries.
The partners in the FutureTDM consortium share the ambition behind the
EC’s call to develop policy and legal frameworks to reduce the barriers of
TDM uptake and with it, promote the awareness of TDM opportunities
across Europe.
6
FutureTDM
7
ELABORATE a legal and policy framework for future TDM, define policy priorities, specify a research agenda to foster the spread of TDM in various research fields within the EU
BUILD a Collaborative Knowledge Base and an Open Information Hub combined on a web-based platform including intuitive tools
ANALYSE current application areas and trends in TDM including
statistics and key figures, collect relevant research and industrial
projects and best practices
ASSESS existing studies, legal regulations and policies on TDM within the European
Union
Main Objectives of FutureTDM
INVOLVE all key stakeholders to identify practices,
requirements, and specific challenges in the field of
TDM
INCREASE awareness of TDM to attract new target groups and science domains
FutureTDM the importance of the community
FutureTDM: Improving uptake of text and data mining
in the EU
9
Importance of the community:
Making the project accessible
to all stakeholders
Collaboration with other
projects like OpenMinTeD
who are focusing on technical
aspects
Reaching the TDM community – what we’ve been doing
We met with people at events, ran our knowledge cafes and held
interviews across Europe
10
Reaching out to the TDM community - media
Perhaps you saw our videos?
We encouraged people to:
Blog on our website www.futuretdm.eu
Tweet us at @futureTDM #FutureTDM
11
What we have produced - Expert Reports and Awareness Sheets
Awareness sheets highlighting TDM projects,
tools and services, organisations and
interviews
FutureTDM Knowledge Library
• Research report on TDM landscape in Europe
• Baseline report of policies, and barriers of TDM in Europe
• European landscape of TDM applications report
• Compendium of best practices and methodologies
• FutureTDM policy framework
Coming soon – practitioner guidelines and roadmap for TDM uptake
12
Key barriers discussed at our first workshop
13
AWARENESS
TALKING POINTS
REWARD & INCENTIVE LEGAL CLARITY
REDUCE SKILLS GAPS ACCESS TO DATA
Knowledge Discovery
Standardisation & Harmonisation
Risk Reduction Licensing?
Privacy Protection & Ethics
Useful data formats
Interoperability - Tools & Resources
No Hidden Barriers
Discoverability
User Friendly
Inclusion Support
Better Data Management Collaboration Across Fields
Promote Innovation
Commercial & Non-Commercial Interaction
Who licenses the internet?
Main users - Industry
Skills lacking
Workshop 1 some feedback
“Anyone with lawful access to data should be allowed to conduct
TDM”. (Advocacy stakeholder)
“The perception seems to be that legal barriers are very significant.
This is true but other issues such as skills and infrastructure are
equally if not more significant and these will take a long time to
address” (Consultant)
“Definitions of TDM may actually be the largest barrier in the
copyright/legal discussions on making progress” (Legal
Practice)
“Licensing remains for us the most certain legal solution,
achieving a good balance between different stakeholders”
(Publisher)
“We need a copyright exception for TDM for commercial and non-
commercial entities” [Science and Innovation Researcher]
14
Now we want to help you
The purpose of the workshop today is to move on from the barriers and talk
solutions. We want positivity, best practice and recommendations to help EU
maximise the TDM opportunity
That’s why we will present:
▪ Economics and impact of TDM
▪ EU Policy recommendations
and later today...
▪ what this might look like in practice for stakeholders
In particular we welcome from you the solutions you see for TDM policy from your
sector’s perspective - how this might look at EU level and in your own environment.
A workshop blog and report will be published on the website after the event
15
What’s next?
Please continue to make your thoughts on this visible to the project so we can continue to
shape best practice:
● tweet today using hashtag #FutureTDM
● write a guest blog on our website
● fill in our feedback cards here today in session 2
● Fill in our quick poll online at www.futuretdm.eu
Save the date 13 June 2017 for for our final Symposium in Salzburg where our
stakeholder guidelines and roadmap for TDM uptake will be made available
16
Our engagement so far...
17
17
Join the Community!
FutureTDM Collaborative Knowledge Base and Open Information Hub
Platform: http://futuretdm.eu
Expert Blog Best Practice Catalogue
Project Database
Tool Selector
Baseline Reports
Comparative Analysis
FutureTDM Guidelines
Leading Projects Area (promoting/knowledge exchange - e.g. OpenMinTeD)
Landscape Overview
OpenDataMonitor
Horizon 2020
Coordination and Support Action
GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)
Project number: 665940
TDM – Economic Perspective
FutureTDMReducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments
using a Collaborative Knowledge and Open Information Approach
Jan Strycharz, FutureTDM Workshop 2, European Parliament, Brussels
29th March 2017
TDM & Big Data
19FutureTDM
Value is KEY
20
The Meaning of Value
21
22
23
Translating data…
Under-standing
SelectingPrepro-cessing
Transfor-mation
Choosing the method
ExploringDiscovering
patternsInterpreting
…into action
TDM Process
Types of Data Analyzed #1
24
Types of Data #2
25
How Much - Market
26
Europe 2016 2021
Big Data 9.4 B USD 29.4 B USD
TDM 2.5 B USD 10.3 B USD
World 2016 2019
Big Data 26 B USD 48 B USD
TDM 7 B USD 17 B USD
How Much – Economic Impact
27
Value
28
1:5
1:11
How much – Economic Impact
29
Obstacles
30
Organizational Culture – Data Savvy
31
Organizational Culture – Data Savvy
32
REFERENCES TO ECONOMIC PERSPECTIVES:
Barel, Yannick. Big Data. Big Opportunity, IBM: Power Systems.
Bradshaw, David, Gabriella Cattaneo, Rosanna Lifonti, John Simcox. 2014. Uptake of Cloud in Europe. Follow-up of
IDC Study on Quantitative estimates of the demand for Cloud Computing in Europe and the likely barriers to take-up,
Framingham: IDC Analyze the Future.
Brown, Brad, Jacques Bughin, Angela Hung Byers, Michael Chui, Richard Dobbs, James Manyika, Charles Roxburgh.
2011. Big data: The next frontier for innovation, competition and productivity, McKinsey Global Institute
Buchholtz, Sonia, Maciej Bukowski, Aleksander Śniegocki. 2014. Big and open data in Europe. A growth engine or a
missed opportunity?, Warszawa: demosEUROPA, Warsaw Institute for Economic Studies.
Buytendijk, Frank. 2014. Hype Cycle for Big Data, Stamford: Gartner.
Danson, Forrest et al. (red.). 2016. Analytics Trends 2016. The Next Evolution, New York: Deloitte.
European Commission, Digital Market Factsheet (2015)
European Parliament. 2016. Towards a thriving data-driven economy, Strasburg.
Forbes Insights. 2015. Betting on Big Data. How the right culture, strategy and investments can help you leapfrog the
competition. Washington.
Fujitsu. 2014. Fujitsu Market Insights. Big Data, Fujitsu Technology Solutions GmbH.
Girard, Greg. From Big Data to Real-Time Business Value, Framingham: IDC Analyze the Future.
Heudecker, Nick, Lisa Kart. 2014. Survey Analysis: Big Data Investment Grows but Deployments Remain Scarce in
2014, Stamford: Gartner.
Heudecker, Nick, Merv Adrian. 2015. Survey Analysis: Hadoop Adoption Drivers and Challenges, Stamford: Gartner.
Heudecker, Nick, Lisa Kart. 2015. Survey Analysis: Practical Challenges Mount as Big Data Moves to Mainstream,
Stamford: Gartner.
Kart, Lisa. 2015. Big Data Industry Insights, Stamford: Gartner.
OpenDataMonitor
Horizon 2020
Coordination and Support Action
GARRI-3-2014 Scientific Information in the Digital Age: Text and Data Mining (TDM)
Project number: 665940
Overcoming legal barriers
FutureTDMReducing Barriers and Increasing Uptake of Text and Data Mining for Research Environments
using a Collaborative Knowledge and Open Information Approach
FutureTDM Workshop II | Brussels | March 29, 2017
Marco Caspers (Institute for Information Law, University of Amsterdam)
Introduction
To discuss:
▪Our work in FutureTDM
▪Policy framework
▪Principles
▪Designing a TDM exception
34FutureTDM
Our work
• Desk research•regulation
•policies
•economy
• Stakeholder engagement•interviews
•workshops
•conferences
• Results:•Barriers to TDM
•Legal & policies
•Skills and education
•Economy
•Technical & infrastructure
•Recommendations!
FutureTDM Policy Framework
• Deliverable D5.1 recommendations cover:•Legal & Policies
•Skills & Education
•Economy & Incentives
•Technical & Infrastructure
• This presentation:•Only legal recos in context of TDM exception
•Underlying research•Legal research into copyright and database law
•Questionnaire on national exceptions
•Response: 16 Member States•Stakeholder engagement
•Views on legal rules
Fundamental principles
37
Scope and design of TDM exception
• From our policy framework:• “Ensure legal rules reflect a fair balance
between the interests of TDM
practitioners and rightsholders (of
copyright and database rights), and
reaffirm that ideas and facts as such are
not protected”
• “Create harmonised and mandatory
rules”
• “Make clear rules” minimise
borderline cases
38
Acts?
Purpose?
Beneficiary?
Overridable?
Subject matter?
Acts
• Reproductions & Extractions• UK (only repro) | EU
• Digital copies• FR
• FutureTDM:• All reproductions and extractions
• No visibility for public• only aggregated information
• No trade on expression
extraction of facts and ideas• No limits to retention
• Reproducibility & verifiability
• Or else: other exception?
39
Acts?
Purpose?
Beneficiary?
Overridable?
Subject-matter?
Purpose
• Scientific research• UK | DSM proposal
• Non-commercial• UK | France
• FutureTDM:• No restriction
• prevent border cases
40
Acts?
Purpose?
Beneficiary?
Overridable?
Subject-matter?
Beneficiary
• Lawful access• UK | FR | DSM proposal
• Research organisation• DSM proposal
• trade-of with ‘non-commercial’
• FutureTDM• Lawful access
• If allowed to read, allowed to mine• trade on ideas, not expression
41
Acts?
Purpose?
Beneficiary?
Overridable?
Subject-matter?
Overridable
• Overridable by contract• Not: UK | FR | DSM proposal
• technical overridability?
• FutureTDM• Not overridable
• Fragmented licensing landscape
• Effectiveness
42
Acts?
Purpose?
Beneficiary?
Overridable?
Subject-matter?
Subject-matter
• No restriction• UK | EU
• Scientific literature• FR
• FutureTDM• No restriction
• Copyright principles apply horizontally
• Uncertainty borderline cases
43
Acts?
Purpose?
Beneficiary?
Overridable?
Subject-matter?
Compensation
• UK | France | EU• No
• Considerations for compensation• purpose: missed out on exploitation of the work (expression)
• no where meant to cover investments (in services)
Measures
• In essence no copyright issue
• Overridability by (technical) security measures?•if yes: may not change status quo
•if no: must achieve fair balance
• How to balance?•voluntary ‘agreed’ best practices?
•service-level obligations?
•transparency?
What’s next?
Want to know more?
• See our Deliverable D5.1 “FutureTDM Policy Framework”
• and upcoming deliverables:•Guidelines for stakeholders (end April)
• You can provide your view in our next session now
FutureTDM Workshop II, 29 March – Any Questions?
Q & A
All slides will be on FutureTDM slideshare after the event
#FutureTDM
47