Text Segmentation Ppt Nitheesha

download Text Segmentation Ppt Nitheesha

of 26

Transcript of Text Segmentation Ppt Nitheesha

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    1/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    2/26

    SEMINAR BY:

    NITHEESHA.S (1JB14LVS07)

    UNDER THE GUIDENCE OF:

    SOWMYA B.J

    DEPT OF ECE

    SJBIT BANGALORE

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    3/26

    With the wide use of networked equipment such as computers,

    scanners, printers and copiers, it has become more important

    to efficiently compress, store, and transfer large document

    files.

    For example, a typical color document scanned at 300 dpirequires approximately 24Mbytes of storage without

    compression .

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    4/26

    Fig 1: MRC document compression standard mode 1 structure.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    5/26

    Segmentation method is composed of two algorithms

    that are applied in sequence:

    The cost optimized segmentation (COS)algorithm

    The connected component classification (CCC) algorithm.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    6/26

    The COS algorithm is a block-based segmentation algorithm

    based upon cost optimization.

    The COS produces a binary image from a gray level or color

    document; however, the resulting binary image typically

    contains many false text detections.

    The COS algorithms is comprised of two components:

    Block wise segmentation and Global segmentation

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    7/26

    Fig 2: COS algorithm comprises two steps: blockwise segmentation and global

    segmentation. The parameters of the cost function used in the global segmentation

    are optimized in an offline training procedure.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    8/26

    Fig 3. Illustration of a blockwise segmentation. The pixels in each block are separated

    into foreground (1)or background (0)by comparing each pixel with a threshold t.

    The threshold t is then selected to minimize the total subclass variance.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    9/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    10/26

    Fig. b. Illustration of how thecomponent inversion step cancorrect erroneous segmentations

    of text. (a) Original documentbefore segmentation. (b) Resultof COS binary segmentation. (c)Corrected segmentation aftercomponentinversion.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    11/26

    The CCC algorithm further processes the resulting binary

    image to improve the accuracy of the segmentation.

    It does this by detecting nontext components (i.e., false text

    detections) in a Bayesian framework which incorporates an

    Markov random field (MRF) model of the component labels.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    12/26

    The CCC algorithm proceeds in three steps:

    connected component extraction,

    component inversion,

    component classification

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    13/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    14/26

    Fig. c. Illustration of a multiscale-COS/CCC algorithm. Segmentation progresses from

    coarse to fine scales, incorporating the segmentation result from the previous coarser scale.

    Both COS and CCC are performed on each scale, however only COS was adapted to the

    multiscale scheme.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    15/26

    A. Preprocessing

    For consistency all scanner outputs were converted to sRGB

    color coordinates [49] and descreened before segmentation . The scanned

    RGB values were first converted to an intermediate device-independent colorspace, CIE XYZ, then transformed to sRGB .

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    16/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    17/26

    Fig. e. Text regions in the binary mask. The region is 165 370 pixels at 400 dpi, whichcorresponds to 1.04 cm 2.34 cm. (a) Original test image. (b) Ground truthsegmentation. (c) Otsu/CCC. (d) Multiscale-COS/CCC/Zheng. (e) DjVu. (f) LuraDocument.(g) COS. (h) COS/CCC. (i) Multiscale-COS/CCC.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    18/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    19/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    20/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    21/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    22/26

    We presented a novel segmentation algorithm for the compression of rasterdocuments.

    While the COS algorithm generates consistent initial segmentations,.

    The CCC algorithm substantially reduces false detections through the use of acomponent-wise MRF context model.

    We showed that the multiscale-COS/CCC algorithm achieves greater text detectionaccuracy with a lower false detection rate, as compared to state-of-the-art commercial

    MRC products. Such text-only segmentations are also potentially useful for documentprocessing applications such as OCR.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    23/26

    [1] ITU-T Recommendation T.44 Mixed Raster Content (MRC), T.44, InternationalTelecommunication Union, 1999.

    [2] G. Nagy, S. Seth, and M. Viswanathan, A prototype document image

    analysis system for technical journals, Computer, vol. 25, no. 7, pp.1022, 1992.

    [3] K. Y.Wong and F. M.Wahl, Document analysis system, IBM J. Res.Develop., vol. 26, pp. 647656, 1982.

    [4] J. Fisher, A rule-based system for document image segmentation, inProc. 10th Int. Conf. Pattern Recognit., 1990, pp. 567572.

    [5] L. OGorman, The document spectrum for page layout analysis,IEEE Trans. Pattern Anal. Mach. Intell., vol. 15, no. 11, pp.

    11621173, Nov. 1993.

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    24/26

    [6] Y. Chen and B. Wu, A multi-plane approach for text segmentationof complex document images, Pattern Recognit., vol. 42, no. 7, pp.14191444, 2009.

    [7] R. C. Gonzalez and R. E. Woods, Digital Image Processing, 3rd ed.

    Upper Saddle River, NJ: Pearson Education, 2008.

    [8] F. Shafait, D. Keysers, and T. Breuel, Performance evaluation andbenchmarking of six-page segmentation algorithms, IEEE Trans. PatternAnal. Mach. Intell., vol. 30, no. 6, pp. 941954, Jun. 2008.

    [9] K. Jung, K. Kim, and A. K. Jain, Text information extraction in images

    and video: A survey, Pattern Recognit., vol. 37, no. 5, pp. 977

    997,2004.

    [10] G. Nagy, Twenty years of document image analysis in PAMI, IEEETrans. Pattern Anal. Mach. Intell., vol. 22, no. 1, pp. 3862, Jan. 2000

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    25/26

  • 8/10/2019 Text Segmentation Ppt Nitheesha

    26/26