Rivet Alfresco Solr v7

download Rivet Alfresco Solr v7

of 32

Transcript of Rivet Alfresco Solr v7

  • 8/2/2019 Rivet Alfresco Solr v7

    1/32

    Integrating Apache Solr withAlfresco WCM for Faceted Search

    and Navigation of Next-Generation

    Web Sites

    Vagif Jalilov

    Rivet Logic

  • 8/2/2019 Rivet Alfresco Solr v7

    2/32

    About Rivet Logic Award-winning professional services focused on:

    Enterprise Content Management

    Web Content Management

    Collaboration and Social Communities

    Using Leading Open Source Software

  • 8/2/2019 Rivet Alfresco Solr v7

    3/32

    Business Case for Alfresco & Solr Large scale sites

    Need for real-time updates

    Full-text search

    Faceted search

  • 8/2/2019 Rivet Alfresco Solr v7

    4/32

    Technical Challenges for Search Accurately index each page

    Solution: Assembly of relevant content to index

    Targeted, real-time indexing Solution: Trigger indexing from publishing

    mechanism

  • 8/2/2019 Rivet Alfresco Solr v7

    5/32

    Possible Index Solutions Spidering/Crawling

    Follow navigational & cross-links

    Parse HTML and fetch relevant content Spider full (or partial) site each time

    Real-time Indexing

    Triggered by FSR deployment

    Process only change-set (incremental updates)

    Assemble relevant page content

  • 8/2/2019 Rivet Alfresco Solr v7

    6/32

    Source Control

    Source code & libs

    View templates Site navigation

    Web content

    CMS (Alfresco)

    Binary Content

    Typical Web Application

  • 8/2/2019 Rivet Alfresco Solr v7

    7/32

    Source Control

    Source code & libs

    (View templates)

    CMS (Alfresco)

    Binary Content

    Web Content Site Navigation

    (View templates)

    Managed (Riveted) Web Application

  • 8/2/2019 Rivet Alfresco Solr v7

    8/32

    Page Composition

    Section-

    html.xml

    Related-links.xml

    Supporting-items.xml

    Meta-

    content.xml

    Page-

    metadata.xml

    dynamic

    dynamic

  • 8/2/2019 Rivet Alfresco Solr v7

    9/32

    Content Delivery

    (http://crafterrivet.org)

  • 8/2/2019 Rivet Alfresco Solr v7

    10/32

    Alfresco WCM Lifecycle

  • 8/2/2019 Rivet Alfresco Solr v7

    11/32

    Indexing Architecture

  • 8/2/2019 Rivet Alfresco Solr v7

    12/32

    Solr Customizations

    Custom Solr

    Schema.xml

    Fields (Type, Indexed/Stored) Unique key

    Solrconfig.xml

    dismax type request handler to define queried fields

    ExtractingRequestHandler (indexing RT docs)

  • 8/2/2019 Rivet Alfresco Solr v7

    13/32

    Custom Solr Schema

    page_url

  • 8/2/2019 Rivet Alfresco Solr v7

    14/32

    ExtractingRequestHandlerpage_textpage_title

    ignored_

    ContentStreamUpdateRequest up = new ContentStreamUpdateRequest("/update/extract");

    up.addFile(new File(filePath));SolrServer solrServer = new CommonsHttpSolrServer(solrServerUrl);solrServer.request(up);solrServer.commit();

  • 8/2/2019 Rivet Alfresco Solr v7

    15/32

    Custom RequestHandler

  • 8/2/2019 Rivet Alfresco Solr v7

    16/32

    Compilation

    Compiler Engine processes all instructions

    Dispatches to appropriate Page Type Compiler

  • 8/2/2019 Rivet Alfresco Solr v7

    17/32

    Content Deployment & Solr Update

  • 8/2/2019 Rivet Alfresco Solr v7

    18/32

    Compiler Instructions

  • 8/2/2019 Rivet Alfresco Solr v7

    19/32

    Compilation Types

    1. Web Pages (HTML)

    2. Rich Text (PDF)

  • 8/2/2019 Rivet Alfresco Solr v7

    20/32

    Web Page Compilation & Indexing

    IndexerInstructions

  • 8/2/2019 Rivet Alfresco Solr v7

    21/32

    HTML Indexer Instruction

    /solutions/content-mgmt/overview.htmlIncrease productivity and streamline workflowthroughout the enterprise

    Commercial enterprises and government agenciesface significant challenges as they strive to meet a rapidly growing need tomanage thousands ...

  • 8/2/2019 Rivet Alfresco Solr v7

    22/32

    Rich Text Compilation & Indexing

  • 8/2/2019 Rivet Alfresco Solr v7

    23/32

    Rich Text Indexer Instruction

    Rivet Logic launches Crafter Studio foruser friendly Web content authoring and publishing.

    NewsPress Release2007-12-19T08:00:00Z

  • 8/2/2019 Rivet Alfresco Solr v7

    24/32

    Compiler Configuration

  • 8/2/2019 Rivet Alfresco Solr v7

    25/32

    Compiler Configuration

  • 8/2/2019 Rivet Alfresco Solr v7

    26/32

    Search UI

    Full text search

    Faceted search on category & type

    Pagination or search result clustering

    Keyword highlighting in search results

    Track user queries

  • 8/2/2019 Rivet Alfresco Solr v7

    27/32

    Search Results Page

  • 8/2/2019 Rivet Alfresco Solr v7

    28/32

    Clustered Results

  • 8/2/2019 Rivet Alfresco Solr v7

    29/32

    Summary

    Requirements:

    Real time updates

    Full editorial control

    Faceted search

    Solution

    Alfresco CMS

    Alfresco plugin for Solr indexing Compile updates & index

    Serve in UI (ft search + facets)

  • 8/2/2019 Rivet Alfresco Solr v7

    30/32

    Q & A

    Thank you for attending :-)

    Questions, comments

  • 8/2/2019 Rivet Alfresco Solr v7

    31/32

    Appendix

  • 8/2/2019 Rivet Alfresco Solr v7

    32/32

    Search Model/API