Zarneger Content Preparation and Delivery to Support AI

22
SILVERCHAIR Content Preparation and Delivery to Support Artificial Intelligence Jake Zarnegar Chief Product Officer, Silverchair

Transcript of Zarneger Content Preparation and Delivery to Support AI

Page 1: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

ContentPreparationandDeliverytoSupportArtificialIntelligence

JakeZarnegarChiefProductOfficer,Silverchair

Page 2: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

Page 3: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

Muchlike“thecloud,”“bigdata,”and“machinelearning”beforeit,theterm“artificialintelligence”hasbeenhijackedbymarketersandadvertisingcopywriters.

Ifthehypeleavesyouasking“WhatisA.I.,really?,”don’tworry,you’renotalone.Iaskedvariousexpertstodefinetheterm andgotdifferentanswers.Theonlythingtheyallseemtoagreeonisthatartificialintelligence isasetoftechnologiesthattry toimitateoraugmenthumanintelligence.

Tome,theemphasisisonaugmentation,inwhich intelligentsoftwarehelpsusinteractanddealwiththeincreasinglydigitalworldwelivein.

OmMalik,TheNewYorkerhttp://www.newyorker.com/business/currency/the-hype-and-hope-of-artificial-intelligence

Page 4: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

GeneralAugmentation

http://www.cnn.com/2017/01/26/health/ai-system-detects-skin-cancer-study/

SpecificAugmentation

TwoTypesofAugmentation

Page 5: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

1:OngoingIndependentLearningFromInteractionwithStimuli

2:ComplexInteractionwithHumans

TwoAI“Augmentation”ConditionsThatMostAgreeOn

Page 6: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

Howcanscientificandscholarlypublishersbestprepareanddelivercontenttoassist(andnothinder)

theadvancementofAIsystems?

Today’sQuestion

Page 7: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

KnowledgeDiscoveryinDatabases(KDD)[1]isdividedinfourmainphases:domainexploration,datapreparation,datamining,andinterpretationofresults.

1. Thefirstphaseisresponsibleforunderstandingtheproblemandwhatdatawillbeusedintheknowledgediscoveryprocess.

2. Thenextphaseselects,cleans,andtransformsthedatatoaformatthatissuitableforaspecificdataminingalgorithm.

3. Inthethirdphase,thechosendataminingalgorithmperformssomeintelligenttechniquestodiscoverpatternsthatcanbeofpotentialuse.

4. Thelastphaseisresponsibleformanipulatingtheextractedpatternstogenerateinterpretableknowledgeforhumans…

Page 8: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

…Mostoftheresearchcarriedoutinthisareafocusonthedataminingphase,whichusesartificialintelligencealgorithmslikedecisiontrees,artificialneuralnetworks,evolutionarycomputation,amongothers[2]todiscoverknowledge.Ontheotherhand, thedatapreparationphase,responsibleforintegration,cleaning,andtransformationofdata,hasnotbeenthesubjectofmuchresearch.Infact,Pyle[3]arguesthat “datapreparationconsumes60to90%ofthetimeneededtominedata– andcontributes75to90%totheminingproject’ssuccess”.

From: PauloM.Goncalves Jr.and RobertoS.M.Barros, "AutomatingDataPreprocessingwithDMPMLandKDDML," 10thIEEE/ACISInternationalConferenceonComputerandInformationScience,2011, DOI:10.1109/ICIS.2011.23.

Page 9: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

“Idownloaded2TBofArxiv contentlastweekbutI can’tbringmyselftoopenitandstartworkingonanalyzingitbecauseIknowIhaveatleast6monthsofpainstakingdatacleanup&preparationaheadofmebeforeIcanbegin.”

--MikeM.,FastForwardLabs

Page 10: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

WhereWeAreNow:ApplyingtheLevelsofCognitiveLearningtoSoftware

Page 11: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR Bloom, et al. 1956

Page 12: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• We’vemasteredthis!• Thefundamentalsofthe

permanentscholarlyrecord(DOI,CLOCKSS,PDF,etc.)

Page 13: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• Alsostrongincreatinginterfacesthatassistunderstandingfromhumanreaders

Page 14: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

FosteringSoftwareUnderstanding

Insomewayswe'vegotagoodfoundation– detailed,consistentcontenttaggingtoaidwithsoftwareUnderstanding

• StructureunderstandingthroughnormalizedXML:whatisthetitle,authors,abstract, wheretheconclusionsareinthepaper,etc.

Increasedtaggingofnamedentities:understandingwhatisagene,whatisaclinicaltrialID,whatisaperson

• Thiscanstillgoawry: "BethIsrael,""BethIsraelDeaconess"examples

Page 15: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

http://anesthesiology.pubs.asahq.org/article.aspx?articleid=2592740

https://academic.oup.com/rheumatology/article/doi/10.1093/rheumatology/kex082/3101351/Musculoskeletal-manifestations-of-Ebola-virus

Page 16: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

1:Interfacesstillprimarilyvisual,narrative

2:HelpfulunderlyingXMLstructurenotshared

3:Littletonotaggingabove“Understanding”level

3ObstaclestoHigherSoftwareCognition

Page 17: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

WhereWe’reGoing:TheRacetotheTop

Page 18: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR Bloom, et al. 1956

Page 19: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• Full-textnormalizedXML(orJSON)• Separateproduct/subscriptionforsale• Separatedeliverymechanism(nohumaninterface)butcan

piggybackonexistingcontentworkflows• Accessesanewclassofcustomerw/deeppockets

(AIcreatorsorimplementers)• Requiresnewvetting/legalagreements

ConsiderProvidingYourStructuredContentasaNewProduct

Page 20: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

…Mostoftheresearchcarriedoutinthisareafocusonthedataminingphase,whichusesartificialintelligencealgorithmslikedecisiontrees,artificialneuralnetworks,evolutionarycomputation,amongothers[2]todiscoverknowledge.Ontheotherhand, thedatapreparationphase,responsibleforintegration,cleaning,andtransformationofdata,hasnotbeenthesubjectofmuchresearch.Infact,Pyle[3]arguesthat “datapreparationconsumes60to90%ofthetimeneededtominedata– andcontributes75to90%totheminingproject’ssuccess”.

From: PauloM.Goncalves Jr.and RobertoS.M.Barros, "AutomatingDataPreprocessingwithDMPMLandKDDML," 10thIEEE/ACISInternationalConferenceonComputerandInformationScience,2011, DOI:10.1109/ICIS.2011.23.

Page 21: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

• Developyourownsoftware(ordevelopanalysis,applicationandevaluation)higherupthecognitionpyramid

• Ifthat’sthecase,don’tshareyourstructuredcontentwithpotentialcompetitors

OrConsiderCompetingDirectly!

Page 22: Zarneger Content Preparation and Delivery to Support AI

SILVERCHAIR

ThankYou

JakeZarnegarChiefProductOfficer,Silverchair