LIMB™ Processing Release Notes - i2S › upload › limbprocessing › LIMB_Release_Notes4.… ·...

14
February 5, 2018 LIMB™ Release Notes 1 LIMB™ Processing Release Notes LIMB 4.3.0 WHAT’s NEW! 06/02/2018 Features - New Page detection module for better performance. - New PDF input management of text PDF. - New PDF input management of multiple layer PDF. - New Tesseract OCR Engine available. - Add MAG export format. - Add Sampling report to the export list. - Add LibSafe connection to the publishing step. - Add Samba share protocol to the publishing step. Improvements - Updated restart services script from notifier. - Feed back to support from Limb UI. Bug Fixes - Fix curve correction process on 600 dpi images. - Fix auto table of content feature when using prefix in page numbering. - Fix specific characters (spanish) management in metadata import from .mrc file. - Fix bug where bad values from database makes app crash. - Fix bug where resize process transform greyscale image to color. - Fix bug on Gamma correction value modification. - Fix outside and edge frame margin error with double pages and pixel unit. - Fix bug where delink between spine and outside edge is not saved. - Fix bug with date time error at startup in some international languages like German. - Fix error in XMP values for PDF and JPEG. - Various crashes and bugs were fixed.

Transcript of LIMB™ Processing Release Notes - i2S › upload › limbprocessing › LIMB_Release_Notes4.… ·...

  • February 5, 2018

    LIMB™ Release Notes 1

    LIMB™ Processing Release Notes

    LIMB 4.3.0 WHAT’s NEW! 06/02/2018

    Features - New Page detection module for better performance. - New PDF input management of text PDF. - New PDF input management of multiple layer PDF. - New Tesseract OCR Engine available. - Add MAG export format. - Add Sampling report to the export list. - Add LibSafe connection to the publishing step. - Add Samba share protocol to the publishing step.

    Improvements - Updated restart services script from notifier. - Feed back to support from Limb UI.

    Bug Fixes - Fix curve correction process on 600 dpi images. - Fix auto table of content feature when using prefix in page numbering. - Fix specific characters (spanish) management in metadata import from .mrc file. - Fix bug where bad values from database makes app crash. - Fix bug where resize process transform greyscale image to color. - Fix bug on Gamma correction value modification. - Fix outside and edge frame margin error with double pages and pixel unit. - Fix bug where delink between spine and outside edge is not saved. - Fix bug with date time error at startup in some international languages like German. - Fix error in XMP values for PDF and JPEG. - Various crashes and bugs were fixed.

  • February 5, 2018

    LIMB™ Release Notes 2

    LIMB 4.2.0 WHAT’s NEW! 15/04/2017

    Features - New block ordering management in segmentation module. - Add a filter block selection based on minimum surface in segmentation module. - Add an automatic block ordering function in segmentation module. - Add a manual block ordering function with mouse over in segmentation module. - New export option to output the same document in RGB, Greyscale and BW. - Updated ALTO output with ALTO LOC v3.1. - New automatic quality control for images with low resolution. - New thumbnail removal tool for imported TIFF files. - Add QC report and sampling report information to the CSV output. - New ABBYY v11 package. - Updated documentation for version 4.2.

    Improvements - Image resize can be done with or without original ratio respect. - Change margin values to decimal for mm and inches. - Change OCR statistics output from CSV to XLSX. - Image padding process can be done with %. - Manual pagination Improvement. - Improve processing time for ABBYY v11. - Add new dongle Codemeter drivers to ABBYYv11 package. - Improve process management on Left/Right page status switch in QC. - Improve connection settings between Limb Processing and Limb Gallery. - Change default thumbnail size in UI for better selection. - Change process list box size in UI for better display.

    Bug Fixes - Fix the crash on erasing the last available frame. - Fix the manual custom fields modification that was not persistent. - Fix OCR statistics output error. - Fix the resolution information after image resize in %. - Fix user rights management on Server version. - Fix the export of image custom fields metadatas. - Fix the simultaneous export of Raw XML and CSV. - Fix the default resolution information read for JP2K files (72 dpi -> 300 DPI). - Fix the mixed import of multiple page and single page files. - Various crashes and bugs were fixed.

  • February 5, 2018

    LIMB™ Release Notes 3

    LIMB 4.1.0 WHAT’s NEW! 15/12/2016

    Features - Advanced manual selection process. - New connection to metadata server through HTTP. - Page detection process can be done on custom color background based on HSL values.

    Improvements - Multicore management on batch processing and OCR. - JP2K output is now based on OpenJpeg library. - Add image list defects to the QC report export format. - Default method for resize process is now “Cubic”.

    Bug Fixes - Fix JP2K export slowness with greyscale images. - Fix OCR result incorrect position of ":" in arabic language. - Fix ICC profile usage that decrease image resolution to 72 dpi. - Fix automatic pages detection where right pages are detected as left pages. - Fix copy and paste in custom fields that cut off text to 255 characters. - Fix curvature correction that is not applied on image due to missing VC redist in the installer. - Fix parallel jobs during OCR step that do not reach the maximum number of cores. - Fix issue with publishing TOC files to Gallery that transfer 2 TOC files. - Fix issue with mixed output that do not export TIFF G4 due to temporary files. - Fix image conversion to B&W that keep image depth to 8 bits. - Fix issue with multiple PDF import and mix of pages order. - Fix PDF exports error when processing large batches. - Fix missing image resolution with JP2K output. - Various crashes and bugs were fixed.

  • February 5, 2018

    LIMB™ Release Notes 4

    LIMB 4.0.0 WHAT’s NEW! 30/06/2016

    Features - Civil Register Module (as an option). - Advanced Custom Fields (as an option).

    - Metadata entry window construction based on an XML file. - New XML and HTML output for document and image custom fields - Automatic structure completion based on image tags. - Improve connection between Limb Processing and Limb Gallery: - Add "Publish" option. - Choose Limb Gallery Template, language and item type to import metadata. - Choose several format to publish - New Raw XML output for document metadata. - Apply XSLT stylesheet to RAW XML output.

    Improvements - Full screen mode window in QC step. - Add a shortcut (CTRL+D) for the erase inside process. - Add information about all selected language in the OCR settings step. - Add option to TOC file for adding or not the physical page number. - METS output : - Add SHA-1, CRC32, MD5 checksum in METS. - Add tech MD on derivatives in METS. - Add JP2 creation date in METS. - Add error message when OCR counter is empty. - Select between two ABBYY dongle. - Client lite installer for the server version. - Center process has new option to disable detection

    Bug Fixes - Fix some auto Pagination crash in page validation step. - Fix some auto pagination issue with large books over 1000 pages that do not give results. - Fix translation for Segmentation module buttons. - Fix inverted word display in OCR QC for Arabic document with IRIS. - Fix PDF import into Hot Folder that do not start. - Fix issue with multiple single page PDF files during import step. - Fix search attribute from the import catalog method that is not saved. - Fix ALTO File extension that is not correct in publishing step (Limb Gallery). - Fix missing for JP2 in METS. - Fix segmentation module crash with specific document. - Fix OCR User Dictionary that is not saved with the template. - Fix Segmentation module low resolution display. - Fix manual pagination window crash if user remove the step value. - Fix error with Working Folder temporary directory creation. - Fix "filename" value from ALTO file is different from image filename. - Fix the only 4 languages available with ABBYY ocr on client computer with server version. - Fix the crash when project is empty and adding an image in the QC step. - Fix synchronization issue with processed image display after some QC processes. - Various crashes and bugs were fixed.

  • February 5, 2018

    LIMB™ Release Notes 5

    LIMB 3.3.2 WHAT’s NEW! 15/01/2016

    Features - Automatically extract Bookmarks from imported PDF and retrieve the information into the structuring. - Customization of folio pagination. Added prefix and suffix for left and right pages. - Added customization of the search attribute for the Z39.50 protocol.

    Improvements - Added full image resolution in structuring step viewer. - Page detection process improvement for dark images. - Improvement of ALTO output in order to pass XSD validation.

    Bug Fixes - Fix the issue with two exported METS files in different location. - Fix missing XML header for exported DC file. - Fix error in MRC entries with some fields. - Fix the crash issue that create a never ending process. - Various crashes and bugs were fixed.

    LIMB 3.3.1 WHAT’s NEW! 15/10/2015

    Improvements - Manage case sensitive metadata server on OAI protocol. - Modify software version to ALTO file - Added full image resolution in document validation viewer.

    Bug Fixes - Fix the issue with only 5 languages available in ABBYY OCR. - Fix the Segmentation module restricted access for desktop users. - Various crashes and bugs were fixed.

  • February 5, 2018

    LIMB™ Release Notes 6

    LIMB 3.3 WHAT’s NEW! 21/09/2015

    Features - New custom folio pagination available into the page validation tool. - Physical page number has been added to the table of content (TOC) output. - Import step can manage automatic metadata with CSV and manual entries at the same time.

    Improvements - OCR engine name has been added to the OCR statistics output. - New keyboard shortcuts for segmentation module to add text zone. - Import step detects 0 kb files to improve speed process. - Spanish characters display has been improved into the MARC record automatic retrieval form. - Added .mrc format to automatic imported metadata files.

    Bug Fixes - ABBYY OCR available languages are updated from the license and not from the template. - Fix bug where project with the same name will not go into the segmentation module. - Fix segmentation shortcuts not available. - Fix bug with missing text entry into METS file output. - Various crashes and bugs were fixed.

  • February 5, 2018

    LIMB™ Release Notes 7

    LIMB 3.2 WHAT’s NEW! 24/06/2015

    Features - New OCR segmentation correction module. - New export output with OCR Statistics and advanced configuration. - Metadata information (EXIF, IPTC, TAGS) can be included into the export step or CSV output. - Manual page numbering can be done with a specific step.

    Improvements - Add a "delete" option for the ICC files. - Add Turkish language to the installer. - New API connection to Limb Gallery.

    Bug Fixes - Fix page numbering issue with ABBYY OCR. - Apply "/" path separator in METS configuration to mix:objectIdentifierValue. - Tag list in the quality control is empty when using the “Hot” folder. - Missing tags in the sampling step when using “Hot” folder. - Multi zone detection process not properly initialized. - Gamma correction process not properly initialized. - Fix METS Validation schema error. - Add multi dongle management and counter to ABBYY11. - Various crashes and bugs were fixed.

    LIMB 3.1 WHAT’s NEW! 30/03/2015

    Features - Update ABBYY v10 with old languages (French, German, Spanish, Italian and Gothic ). Requires a specific

    ABBYY license. - Manage several ABBYY licenses (or hardware dongle). - New API connection to Limb Maestro.

    Improvements - Improve page validation tool for books bigger than 1000 pages. - Improve multi zones detection. Remove the maximum limitation. - Improve error management during export step. - Improve pause mode management for multiple jobs during the OCR step.

    Bug Fixes - Correct the IPTC “Country-Primary Location Code” that needs at least 3 digits. - Correct missing IPTC values in the exported file. - Correct user interface issue with Italian language - Correct missing sliders in the general settings for “Manual Processing Number of Cores” in desktop

    edition. - Various crashes and bugs were fixed

  • February 5, 2018

    LIMB™ Release Notes 8

    LIMB 3.0 WHAT’s NEW! 03/02/2015

    Features - New OCR Correction Module

    - Added a new Multiple Zone Detection tool

    - New CSV advanced export

    - Added support for PDF/A-2B and PDF/A-3B

    - Cores management for OCR and Processing

    Improvements - Improved the metadata panel with support for transformation stylesheets

    - Added additional export format naming options

    - Improved support for custom metadata fields

    - Added the ability to use a custom dictionary with ABBYY OCR engines

    - Improvements to batch processing

    - Improvements to page detection

    - Use jHove for exporting JPEG and TIFF Mix files

    - Improved resize methods

    - Improved global memory management

    Bug Fixes - Fixed bug related to missing ALTO files

    - Various crashes and bugs were fixed

    LIMB 2.3.1 WHAT’S NEW! 04/09/2014

    Bug Fixes & Improvements

    - Improvements to Image Processing - Improved OCR Scheduler behavior - Fixed bug related to resizing color images in a PDF - Fixed error where some users could not retrieve Marc records - Fixed various bugs related to IPTC and CSV metadata - Fixed memory leak when processing 48bit images - Fixed bug when manually starting the OCR step that was paused by the OCR Scheduler - Various crashes and other bugs were fixed

  • February 5, 2018

    LIMB™ Release Notes 9

    LIMB 2.3 WHAT’S NEW! 25/06/2014

    Features - Optimized Working Folder – The working folder with temporary files is now optimized to minimize its

    size and increase processing speed. In the general settings of the application you can choose between 3 modes

    1. Non-destructive: all temporary files are saved as TIFFS 2. Optimized: all temporary files are saved as JPEG (compression ration can be set) 3. Automatic (default mode): temporary files are saved as JPGs if the input format is a lossy

    format such as a JPG or the temporary files are saved as TIFFs if the input format is a lossless format.

    - EXIF/IPTC Metadata Handling - Added new feature to embed metadata in the header of the images (Exif & IPTC fields). Metadata can be set manually or dynamically using the descriptive metadata

    - ICC Profile Management - New Export option to attach ICC profile to an image - LIMB API – New documentation available for the API of LIMB. Those API enable to control LIMB from a

    third party tool. http://www.i2s-cloud.com/public.php?service=files&t=1310379170fac6ba239f4ec63035fba8

    - Export Path Settings – New settings for the export path : it now possible to set a path for each format individually allowing to export one format to a sever and another format to another server as an example

    - Page Border Deskew – New deskew option to deskew based on page borders in addition to the current deskew method based on text lines

    - Remove Borders after Deskew – New option to remove borders after deskew

    Bug Fixes & Improvements

    - Improvements in page detection algorythm : detection of pages on clear background and general improvements

    - Ability to rename images during QC - Corrected bugs related to the LIMB Server Client/Server mode - Fixed black and gray borders sometimes seen after apply curvature correction to the image - Fixed various crashes

    http://www.i2s-cloud.com/public.php?service=files&t=1310379170fac6ba239f4ec63035fba8http://www.i2s-cloud.com/public.php?service=files&t=1310379170fac6ba239f4ec63035fba8

  • February 5, 2018

    LIMB™ Release Notes 10

    LIMB 2.2.2 WHAT’S NEW!

    Bug Fixes & Improvements

    - Improved ABBYY11 processing and consistency - Fixed crashes sometimes seen when opening and closing the application - Fixed Auto upgrade button - Fixed auto deskew not updating the large image in the Quality Control UI - Fixed DPI issues when Curvature correction tool was applied - Add warning to Automatic Pagination tool if OCR engine is not detected - Fixed manual metadata entry problem and improved import metadata behavior with various fixes - Fixed error where OCR engine displayed in list even if the engine was not available - Various crashes and bugs

    LIMB 2.2.1 WHAT’S NEW! 18/03/2014

    Features - Automatic Color Mode – Added new Automatic Color Mode which automatically sets the color depth

    for the image using a histogram. If the image contains color or grayscale then the image is converted

    accordingly. If there is no color or gray level then it is reduced to black and white

    - PDF Table of Contents Highlight - Added option to PDF export to hyperlink the table of contents in the

    PDF output

    - Auto Table of Contents Zone Editing - Auto Table of Contents now supports manually editing the

    highlight zone for the PDF

    - Image Side Tagging - Added support for tagging the image side (left/right) in manual Quality Control

    - OCR QA Image and Text Review - New text and image highlight in OCR QA based on the OCR output

    - Advanced PDF Export - Added advanced PDF output options to export the PDF with different embedded

    image formats and settings based on the bit depth of the input file

    - LIMB Server Feature - OCR can now be executed on a separate, dedicated server

    Bug Fixes & Improvements - Multiple Manual User Interfaces can now be opened at the same time (i.e. manual quality control,

    structuring)

    - Added option to METS output for automatically validating the METS schema

    - Improved manual deskew tool

    - Improved advanced configuration

    - Added warning when importing a text only PDF (currently not supported)

    - Images with embedded ICC profiles caused image processing to fail

    - Fixed export error for books that were sent back to QC from the sampling step

    - Accent marks not supported in watermarking

    - Cannot export Word document if the images are processed or tagged at 96 DPI or less

    - Test connection button for Marc record server retrieval does not show any results

    - Fixed ALTO validation errors

    - Crashes and errors related to the image processing list in Quality Control

    - Various crashes

  • February 5, 2018

    LIMB™ Release Notes 11

    LIMB 2.1.1 WHAT’S NEW!

    Features - MIX Export – Generate MIX files linked to a specific image export format

    Bug Fixes & Improvements - Corrected METS file validation errors

    - Fixed memory leak in Automatic Quality Control

    - Minor Padding tool bug fix

    LIMB 2.1.0 WHAT’S NEW! 31/10/2013

    Features - Metadata Import – New metadata import interface. Includes automatic metadata retrieval from Z39.50

    and OAI servers, CSV matcher, retrieval from a master CSV file and verification for the presence of

    metadata in the job folder

    - ALTO Generation – Generate ALTO files linked to an export format

    - BNF ALTO – Added support for BNF ALTO output

    - METS Improvements – Improvements to the METS output and added flexibility

    Bug Fixes & Improvements - Fix manual split function in Quality Control interface

    - Fixed TIFF G4 images appearing inverted in some viewers

    - Fixed some images appearing inverted after a image processing tool was applied

    - MODS, MARC and DC model improvements and bug fixes

    - Allow two key fields to be added to the MODS, MARC, DC and CSV outputs

    - PDF title now shows in title bar instead of "Unknown"

    - Padding tool now allows the user to input by pixels, mm or inches. Other Padding Tool bug fixes

    - Delete from finish queue and dashboard now deletes working folder from file system

    - Added detection of page slope options to detect page region

    - Resize image tool improvements

    - Light correction tool improvements

    - Page Validation tool thumbnails now includes the original OCR'd number in a light blue box

    - Licensing improvements and bug fixes

    - Fixed reprocessing a book from finished queue and deleting images resulted in export error

    - Removed check that forced user to select OCR for outputs that required OCR

    - Various crashes fixed

  • February 5, 2018

    LIMB™ Release Notes 12

    LIMB 2.0.1 WHAT’S NEW!

    Features - ABBYY 10 OCR Engine – Added compatibility with ABBYY 10 OCR Engine

    Bug Fixes & Improvements - Minor bug fixes

    LIMB 2.0 WHAT’S NEW! 13/06/2013

    Features - Hot folder Monitoring - New queue for monitoring scanned books that will be automatically added to

    the queue with a pre-determined workflow.

    - Mixed Export mode - Exports one set of images with different customizable image formats in the same

    output folder.

    - Document validation - New step for verifying all the pages of the document are captured. Feature uses

    OCR engine for page number verification.

    - Support for Word document output

    - Support for XML document output

    - Can now run more than one job simultaneously. Option is available in the settings.

    - Current and Finished Dashboard Queues can now be filtered. Feature can be accessed by hovering

    over the Status section of the dashboard toolbar.

    - Notes - New note function now available on the dashboard and the manual user interfaces.

    Bug Fixes & Improvements

    - Added option to export output inside the input folder

    - QC performance improvements

    - LIMB processing improvements

    - Added previous sample button and keyboard shortcuts to sampling UI

    - CR2 processing improvements

    - Sampling preview improvements

    - Bug fixes in margin detection and center detection

    - Improved PDF import and processing

    - Export processing time improved

    - General LIMB UI changes and enhancements

    - Improved memory management in LIMB

    - Manual QC can now be unchecked separately from other QC options

    - Fixed various crashes reported to LIMB support

  • February 5, 2018

    LIMB™ Release Notes 13

    LIMB 1.2 WHAT’S NEW! 19/03/2013

    Features - YOOLIB™ Integration – Added support for sending documents directly to YOOLIB from the

    LIMB interface. User can choose which categories or collections to send the document to from

    within the LIMB workflow creator.

    - OCR Quality Assurance Interface – Allows the user to see information about the OCR output

    for the job.

    - Document Reprocessing – Jobs can now be reprocessed from the finished queue. Job can be

    sent back to all steps available in LIMB including the template phase.

    - Push Back – Jobs can be pushed back while in the Current queue. Options can be accessed from

    the right-click menu or the side panel.

    - OCR Table of Contents Improvement – User can now select the area that OCR will be applied

    to on the table of contents page. This can be done in the OCR TOC window by simply drawing a

    new box on the table of contents image. User can draw multiple boxes if needed.

    - Option to QC Multiple Jobs at Once – New option to combine all input documents into one job.

    All images for the jobs will appear in the QC window at the same time. The jobs will be

    exported as separate folders.

    - Detection for Incorrect Crop Boxes – A new option is now available under the Quality Control

    Tab in the workflow creator for flagging image that may have bad crop locations. This is only

    available if the automatic page detection tool is used.

    - Input Documents – Added the new option to the workflow creator to input documents. User

    can now select a document directly from a folder.

    Bug Fixes & Improvements

    - Improvements made to the QC interface to make it more responsive.

    - Fixes for timeouts to the licensing server and error of insufficient funds for OCR.

    - Fixed bug in the rescan tool in QC

  • February 5, 2018

    LIMB™ Release Notes 14

    LIMB 1.0.1.1 What's New!

    Bug Fixes - When selection a page from the structure tree for Auto table of Contents, the OCR Engine is not

    initialized properly.

    - Various crashes

    LIMB 1.0.0.0 What's New!

    Features - Added Link to the LIMB YouTube Tutorial page – Videos soon to come -

    Improvements - Improved stability when processing a large set of images.

    - New Icons for the Copy and Delete functions that are available in Left/Right Processing mode under the Image Processing tab.

    - Add a checkbox to activate or deactivate Automatic Quality Control.

    Bug Fixes - When an Image-on-Text PDF is resized the XY coordinates for the word location is incorrect.

    - Added labels to the shortcuts in Structuring.

    - Added more translations for French user interface.

    - New icons for the all button in Quality Control and Structuring