Marketplace Overview: Text Analytics Vendor Options Nick Patience

Post on 28-Nov-2014

3.196 views 2 download

description

 

Transcript of Marketplace Overview: Text Analytics Vendor Options Nick Patience

Marketplace Overview: Text Analytics Vendor Options

Nick PatienceResearch Director, Information Management

The 451 Group

Choosing a vendor: things to consider

► YOUR REQUIREMENTS� Corpus size and growth

� Scalability

� On-site vs. SaaS

� Languages

� Interoperability with existing systems

� Compatibility with future tech (OWL, RDF, XML, etc)

Choosing a vendor: things to consider

► VENDOR ISSUES� Are there other

customers in your specialty?

� Viability of vendor

� Willingness for pilot or proof of concept

� Help available for configuration and installation

Issues affecting the text analytics market

1. Mergers and acquisitions

2. Regulatory mandates (FRCP, SarbOx)

3. On-premise licensing or SaaS

4. Economic uncertainty – discretionary or must-have?

5. Does a market even exist?

M&A to Date

Boost Sharepoint search1/08$1.24bMicrosoftFAST

Customer buying supplier4/07$25M*ReutersClearforest

Need to understand text5/07$76MBiz ObjectsInxight

E-discovery10/07$158MIron MountainStratify

Acquire own text analytics3/08$10-15m*SAS InstituteTeragram

Why?WhenDeal ValueAcquirerTarget

*451 estimate; Source: 451 M&A KnowledgeBase

►eDiscovery

Business Drivers

December 1, 2006:

Effective date of the Electronic Discovery Amendments to the Federal Rules of Civil Procedure

Business Drivers

►Electronic Publishing

“There will be no media consumption left in ten years that is not delivered over an IP network. There will be no newspapers, no magazines that are delivered in paper form. Everything gets delivered in an electronic form.”

Steve Ballmer, CEO Microsoft, June 6, 2008

Business Drivers

► Security / Fraud Detection / Risk Mgmt.

Market Map – End of 2008

Govt/Military Intelligence

Pharma & Life Sciences

Early Warning (Mfg)

Banking & Insurance

Media & Publishing

SAP [Inxight], IBM Cognos, SAS, SPSS, Attensity, Autonomy, Infonic

Temis, IBM, SPSS

SPSS, SAS, Attensity

IBM, SAS, SPSS, Autonomy, FAST, Megaputer

FAST, Temis, Nstein, Infonic,Autonomy, ClearForest, Lexalytics

Market Map – End of 2008, cont.

Market Research & Surveys

Customer Analytics

Business Intelligence

Security

General OEM

SPSS, SAS

SAS, SPSS, Autonomy, Attensity, IXReveal

SAP [Inxight], Cognos, Clarabridge

Autonomy, IBM, Cyveillance

Basis, Lexalytics, SAS [Teragram]

Records Management IBM

►Travelocity►Whirlpool►JetBlue

the Attensity Text Analytics suite

►Law enforcement►Travel and hospitality

►Intelligence►Customer analytics

Key Customers:

►Auto-categorization

►Anaphora resolution

►Output in OWL ontology language

►Statistical extraction

Base: Palo Alto, CA Funding: $28m venture capital

Key verticals:

�SaaS

►Avg deal size: $250,000 before services

►“Exhaustive extraction”

►Targeted extraction

�On premise: Windows, Linux

Founded: 2000

►Halliburton►Gillette

►Standard & Poors►Cisco►Sony

►Banking and insurance

►Media and publishing

►Law

►Government / military intelligence

►Security

►Customer analytics

Key Customers:

►“Conceptual retrieval”

►Automatic taxonomy generation

Base: Cambridge, UK Funding: Public

Key verticals:

►Avg deal size: not available

►Automatic categorization

�SaaS: XXXX�On Premise: Windows, Linux, Solaris, AIX

IDOL Server 7

Founded: 1996

►HP►Yahoo

►Siebel►FAST

►Name matching

►Name Translation

►Multilingual text analytics

►Language identification

►Entity extraction

►Google►Oracle

►General OEM

►Government / military intelligence

►Commercial Search Engines

Key Customers:

Base: Cambridge, MA Funding: < $10 million, In-Q-Tel

Key verticals:

►Avg deal size: $250-300,000

�SaaS: XXXXXXX�On Premise: Windows, Linux, Solaris,

Rosette Linguistics Platform

Founded: 1995

�SaaS: XXXXXX�On premise: Windows, Solaris, Linux, HPUX, UIX

►Entity extraction

►Document-level classification

►Document summarization

►32 languages

►Segmentation

►Stemming

►Part-of-speech tagging

►Federal Agencies (DOA, DAA, DHS)

►OEM: SAS, IBM, Oracle

►Business intelligence►Government & military intelligence

Key Customers:

Base: Walldorf, Germany Funding: Public

Key verticals:

►Avg deal size: $$$$

BusinessObjects Text Analysis

Founded: 1972

►BI-tool friendly►Categorization

►Gaylord Hotels►Intuit►H&R Block

►Business intelligence

Key Customers:

Base: Reston, VA Funding: $10.2m, venture capital

Key verticals:

�SaaS

►Avg deal size: $150-300,000, $10,000 / month SaaS

�On-premise: ???????

Content Mining Platform

Founded: 2005

Calais

►Air Force

►Entity, fact and event extraction

►Packaged extraction modules

►Statistical and semantic tagging

►Tagging concepts

►Categorization

►Semantic tagging

►Elsevier►Dow Jones

►Media and publishing

Key Customers:

Base: Waltham, MA Funding: Public

Key verticals:

�SaaS: available

►Avg deal size: Not available

�On Premise

Founded: 1998

�SaaS: XXXXXXX�On Premise: Windows, Linux, HP UX, Solaris, UIX

►Autotrader.com

►Thesaurus

►Phrase detection

►Spell-checking

►Anti-phrasing

►Language detection

►Lemmatization

►Synonyms

►WeightWatchers.com

►National Instruments

►Media & publishing►Banking & insurance

Key Customers:

Base: Needham, MA Funding: MSFT, public

Key verticals:

►Avg deal size: $$$

FAST ESP

Founded: 1997

►Large financial data provider

►Keyword search

►Semantic search

►Drill down search

►Trend analysis

►Delta analysis

►Automated alerting

►Large Japanese auto manufacturer

►Large Japanese telcoprovider

►Security

►Records management

►Banking & insurance

►Military / govt intelligence

►Pharma & life sciences

Key Customers:

Base: Armonk, NY Funding: Public

Key verticals:

►Avg deal size: $$$$

�SaaS: XXXXX�On Premise: Windows, AIX, Linux

Omnifind Analytics Edition

Founded: 1889

►Sentiment analysis of print media

►Dow Jones factiva►Thomson Reuters

►Media and publishing

Key Customers:

Base: London, UK Funding: Public

Key verticals:

►Avg deal size: Not available

�SaaS:XXXXXXXX�On premise: Windows

Sentiment

Founded: 2000

►Fireman’s fund

►Categorization – Bayesian, SVD, Keyword, concept search

►Clustering

►Classification

►Concept extraction

►Thesaurus

►Relationship discovery

►Jacksonville Sheriff’s office

►Security►Law enforcement

Key Customers:

Base: Jacksonville, FL

Funding: Private

Key verticals:

�SaaS: available

►Avg deal size: $

�On Premise: Windows

uReveal

Founded: 2000

�SaaS: XXXXX

►SmartBrief

►Cisco Systems

►Sentiment extraction

►Tailored sentiment toolkit

►Entity extraction

►Entity relationships

►Document summarization

►FT.com

►Cymfony

►Marketing & surveys►Media & publishing

Key Customers:

Base: Amherst, MA Funding: Private

Key verticals:

►Avg deal size: $125-150,000

�On premise: Windows

Salience Engine w/ Sentiment Toolkit

Founded: 2003

►DVA

►FAA

•Taxonomy-based categorization►Taxonomy creation

►Entity extraction

►Clustering

►Ernst & Young

►Pfizer

►Pharmaceuticals

►Insurance

►Defense

►Aviation

Key Customers:

Base: Bloomington, IN Funding: Private

Key verticals:

�SaaS: XXXXXX

►Avg deal size: $300,000

�On premise: Windows

Polyanalyst

Founded: 1997

Text Mining Engine (TME)

�SaaS: XXXXX�On premise: Windows, Linux

►Reader’s Digest

►Time, Inc.

►Optional summarizer

►Sentiment analysis engine

►Automated entity extraction

►Categorizer

►Concept extraction

►Taxonomy management

►Le Monde

►Conde Nast

►Reed Business

►Media & publishing

Key Customers:

Base: Montreal, Quebec Funding: Public

Key verticals:

►Avg deal size: $750,000

Founded: 2001

SAS Text Miner►Clustering►POS tagging

►Concept extraction

�SaaS: XXXXX�On premise: Windows, Solaris, AIX

►Eli Lilly

►Department of the Treasury

►Multiple languages

►Stemming

►Multi-lingual

►Entity extraction

►Ford

►Pitney Bowes

►Customer analytics

►General OEM

►Market research & surveys

►Government / military

►Early warning (mfg)

►Banking & insurance

Key Customers:

Base: Cary, NC Funding: Private

Key verticals:

►Avg deal size: $200-300,000 (Inxight 2005)

Founded: 1976

�SaaS: XXXXX

Clementine 12►recency, frequency and monetary

►survival analysis

�On Premise: Windows, Linux, Solaris, HP-UX, IBM AIX

►Support Vector Machines algorithms

►Bayesian Networks algorithms

►Multi-lingual sentiment analysis

Fortune 500

►Customer analytics

►Early warning (mfg)

►Customer analytics

►Banking & insurance

►Market research & surveys

►Govt / military intelligence

►Pharma & life sciences

Key Customers:

Base: Chicago, Illinois

Funding: Public

Key verticals:

►Avg deal size: $$$

Founded: 1968

�SaaS: Hosted version available

►Concept-based searching

►Keyword searching

�On premise: Windows, Linux

BASF

►Entity extraction

►Categorization

►Information clustering

Novartis

Pfizer

►Industrial►Govt / military intelligence

►Pharma & life sciences

Key Customers:

Base: Paris, FR Funding: €7m, private equity

Key verticals:

►Avg deal size: €3,000-10,000 per user per year. On-premise version is priced on a per CPU basis and typically costs €200,000-300,000

Luxid

Founded: 2000

�SaaS: XXXXX

►Entity extraction and stemming

►Classification

►Discovery

�On Premise: Linux

►Base set of 10 taxonomies

►Statistical and NLP techniques

►“Frame of Reference”

►Federal agencies

►Govt / military intelligence

Key Customers:

Base: Mclean, VA Funding: Private - undisclosed

Key verticals:

►Avg deal size: ????????

Viziant 1.0

Founded: 2003

Sentiment Analysis

► Andiamo Systems

► Biz360, a veteran of the space

► BrandIntel

► Buzzlogic, a recent startup

► Collective Intellect – about a year old

► Jodange – media-based opinion tracking for chosen topics or influencers

► Monitor110, aimed at institutional investors

► MotiveQuest, tweaks its linguistic model depending on the domain being analyzed

► Nielsen Media Research's BuzzMetrics – the 800-pound gorilla that rolled up some of the early players

► Northern Light - veteran search company, with its MI Analyst sentiment analysis product

► Perception Metrics, claims to be able to do phrase-level sentiment analysis, aimed at PR and marketing professional

► RavenPack International, counts Dow Jones & Company as a partner Sentiment Metrics, a British-based brand monitoring company

► SAS – offers the service

► SPSS – offers the service

► Sentiment Metrics

► SentiMetrix – still in stealth, apparently

► ScoutLabs, is in beta and uses Lexalytics technology

► SkyGrid, aggregates and analyzes financial news

► Summize, analyzes online product reviews for sentiment

► Umbria, focused on online sentiment analysis of social media, such as blogs

Questions?nick.patience@the451group.com

Nick Patience

Research Director, Information Management

http://blogs.the451group.com/information_management/