Web Of Books

29
Peter Brantley Tools of Change Internet Archive Frankfurt, Germany The Presidio 10.09

description

Presentation at Tools of Change Frankfurt introducing the BookServer architecture. Brief description of history, motivations, and technical outline.

Transcript of Web Of Books

Page 1: Web Of Books

Peter  Brantley      Tools  of  Change  Internet  Archive      Frankfurt,  Germany  The  Presidio      10.09  

Page 2: Web Of Books

 Entering  the  digital  fold,    a  tangled  landscape:  

1.  finding  the  book  2.  format  of  the  book  3.  acquiring  the  book  

Page 3: Web Of Books

 Digital  channels  are  fragmented  ...    

  web  search?    (Google,  Bing,  etc)    publisher  site?  (tor.com  ...  )    the  local  library?    (borrowing/lending)    online  bookstore?  (Amazon,  Indigo)      alt.  vendor?  (Smashwords,  Shortcovers)  

Page 4: Web Of Books

 What  is  the  reader  getting?  

  highly  structured  display  (PDF)    downloadable  book  (EPUB,  MOBI)    cloud-­‐based    (EPUB  >  HTML,  Flash)    not  really  available  at  all    (biblio  data,  ILL)    

Page 5: Web Of Books

Plethora  of  devices  –    

  iPhone  |  Android      Sony  Reader  |  iRex  Illiad  |  BeBook  |  Bookeen            Plastic  Logic  |  Amazon  Kindle  reader  device  

  traditional  laptop    game  console  (Wii)      near-­‐mythical  Apple  Tablet    

Page 6: Web Of Books

 +  Device      +  Format      +  Discovery      +  Acquisition    +  Installation  

(  +  DRM  )          =      CONFUSION.  

http://www.flickr.com/photos/dan4th/2295925353/  

Page 7: Web Of Books

What  readers  want  to  have  ..  

Be  able  to  find  the  books  they  want,  in  the  formats  that  they  can  use,    for  the  device  that  they  have,  

and  not  have  it  be  painful.    

Page 8: Web Of Books

 What  publishers,  libraries,  bookstores  want  -­‐  

 Make  books  available  for  discovery,    with  accurate  descriptive  information,    at  as  many  different  places  as  possible,    under  the  sales  /  use  terms  permitted.  

Page 9: Web Of Books
Page 10: Web Of Books

 Even  the  U.S.  Department  of  Justice  is  an  advocate:  

 “[book]  data  provided  should  be  available  in  multiple,  standard,  open  formats  supported  by  a  wide  variety  of  different  applications,  devices,  and  screens.”  

Page 11: Web Of Books
Page 12: Web Of Books

 Creating  a  new  architecture  using  common,  open  standards  that  permits  people  to  find,  buy,  acquire,  and  read  books  from  any  source,  on  any  device,  using  many  different  ebook  applications.      

Page 13: Web Of Books
Page 14: Web Of Books

 Library  2.0  Gang  (02/09):    Google  books  and  libraries  

   various  email  discussions  of  nascent      “Open  Catalogue  Crawling  Protocol”    

 Google,  DLF,  Talis,  and  others    Atom  vs  Sitemap  discussions  

Page 15: Web Of Books

IDPF  Board      conference  calls  

Tools  of  Change  (NYC,  Feb  2009)    hallway  conversations  

Web  Expo  2.0  (SF,  Apr  2009)    pinot  noir    

Page 16: Web Of Books

 “The  Open  Publication  Distribution  System  (OPDS)  is  a  generalization  of  the  Atom  [XML]  approach  used  by  Stanza's  online  catalog.    ...    I  believe  this  effort  has  the  potential  to  be  a  critical  enabler  to  the  growth  in  access  to,  and  adoption  of,  digital  books.”    

   -­‐  Bill  McCoy,  Adobe,  04.09  

Page 17: Web Of Books

“BookServer”  is  the  architecture.    

“OPDS”  is  the  technical  specification.  

“Catalogs”  are  made  using  OPDS.  

“Atom”  is  the  XML  scheme  for  OPDS.  

Page 18: Web Of Books

 Because  OPDS  is  based  on  a  commonly      used  XML  standard,  called  Atom  –        

 OPDS  Catalogs  can  be  rendered  or  read  by  –  

  web  browsers      news  readers  (rss)   mobile  applications  

Page 19: Web Of Books

Because  Catalogs  are  easy  to  make  –    

  any  web  site  can  run  a  bookstore.    libraries,  bookstores,  publishers  can  play.    search  engines  can  serve  as  book  gateways.    aggregators  (IA,  Ingram,  etc.)  can  harvest  multiple  catalogs.  

Page 20: Web Of Books

 Because  Catalogs  contain  simple  data  describing  books  and  their  availability  –  

   Catalogs  can  also  be  used  for  B2B,  to  distribute  data  to  partners  for  “harvest”  instead  of  using  complicated  standards.  

 (Future:  “real  time  web”  notifications.)  

Page 21: Web Of Books

Catalogs  provide  manifests  –  

  list  of  the  titles  available    information  about  each  title    formats  the  title  is  available  in    ways  the  title  can  be  acquired    

Page 22: Web Of Books

A  reader  ...    

1.  browses  a  Catalog  of  titles  2.  selects  a  title  for  more  information  3.  makes  a  purchase/borrow  decision  4.  obtains  the  book  (PayPal,  Amazon,  etc.)  5.  installs  and  reads  the  book.  

Page 23: Web Of Books

 Catalogs  can  be  derived  from  basic  bibliographic  metadata.    Such  as:    

 ONIX,  MARC,  (ahem)  spreadsheets  

 Internally  OPDS  Catalogs  use    simple  Dublin  Core  metadata    to  describe  the  titles  offered.  

Page 24: Web Of Books

ONIX  (and  BISG  “BookDROP”)  are:  

    designed  for  a  different  use  cases      complex  standard  with  many  options      not  widely  used  beyond  publishing        not  understood  by  web  browsers        established;  change  is  difficult    

Page 25: Web Of Books

 Because  we  use  open  standards  for  describing  data,  it  is  possible  to  link  bibliographic  book  data  more  easily.  

Page 26: Web Of Books

Catalogs  could  tie  together  –    

  book  reviews    reading  lists    annotations    fan  fiction  

  etc.  

Page 27: Web Of Books

A  workshop  sponsored  by  the  Internet  Archive  October  19-­‐20,  Fort  Mason  San  Francisco,  California  

With  the  assistance  (among  many  others):        O’Reilly  Media    http://oreilly.com/  

    Threepress  http://threepress.org/      Feedbooks  http://feedbooks.com/  

    Book  Oven    http://bookoven.com/  

Page 28: Web Of Books

  Adobe    Aldiko      (Amazon)  Lexcycle    Applewood  Books    Book  Oven    Booki    Feedbooks    HumanWare  

  Ingram  Digital    O’Reilly  Media    OLPC    Pixel  Qi    Shortcovers    Threepress    ...  psst  ...  

  and  others    

Page 29: Web Of Books

Contact  information:  

peter  brantley      internet  archive  @naypinya  (twitter)      peter  @  archive.org  

keith  fahlgren        o’reilly  media    @abdelazer  (twitter)    keith  @  oreilly.com