LIMB™ Processing Release Notes - i2S › upload › limbprocessing › LIMB_Release_Notes4.… ·...
Transcript of LIMB™ Processing Release Notes - i2S › upload › limbprocessing › LIMB_Release_Notes4.… ·...
-
February 5, 2018
LIMB™ Release Notes 1
LIMB™ Processing Release Notes
LIMB 4.3.0 WHAT’s NEW! 06/02/2018
Features - New Page detection module for better performance. - New PDF input management of text PDF. - New PDF input management of multiple layer PDF. - New Tesseract OCR Engine available. - Add MAG export format. - Add Sampling report to the export list. - Add LibSafe connection to the publishing step. - Add Samba share protocol to the publishing step.
Improvements - Updated restart services script from notifier. - Feed back to support from Limb UI.
Bug Fixes - Fix curve correction process on 600 dpi images. - Fix auto table of content feature when using prefix in page numbering. - Fix specific characters (spanish) management in metadata import from .mrc file. - Fix bug where bad values from database makes app crash. - Fix bug where resize process transform greyscale image to color. - Fix bug on Gamma correction value modification. - Fix outside and edge frame margin error with double pages and pixel unit. - Fix bug where delink between spine and outside edge is not saved. - Fix bug with date time error at startup in some international languages like German. - Fix error in XMP values for PDF and JPEG. - Various crashes and bugs were fixed.
-
February 5, 2018
LIMB™ Release Notes 2
LIMB 4.2.0 WHAT’s NEW! 15/04/2017
Features - New block ordering management in segmentation module. - Add a filter block selection based on minimum surface in segmentation module. - Add an automatic block ordering function in segmentation module. - Add a manual block ordering function with mouse over in segmentation module. - New export option to output the same document in RGB, Greyscale and BW. - Updated ALTO output with ALTO LOC v3.1. - New automatic quality control for images with low resolution. - New thumbnail removal tool for imported TIFF files. - Add QC report and sampling report information to the CSV output. - New ABBYY v11 package. - Updated documentation for version 4.2.
Improvements - Image resize can be done with or without original ratio respect. - Change margin values to decimal for mm and inches. - Change OCR statistics output from CSV to XLSX. - Image padding process can be done with %. - Manual pagination Improvement. - Improve processing time for ABBYY v11. - Add new dongle Codemeter drivers to ABBYYv11 package. - Improve process management on Left/Right page status switch in QC. - Improve connection settings between Limb Processing and Limb Gallery. - Change default thumbnail size in UI for better selection. - Change process list box size in UI for better display.
Bug Fixes - Fix the crash on erasing the last available frame. - Fix the manual custom fields modification that was not persistent. - Fix OCR statistics output error. - Fix the resolution information after image resize in %. - Fix user rights management on Server version. - Fix the export of image custom fields metadatas. - Fix the simultaneous export of Raw XML and CSV. - Fix the default resolution information read for JP2K files (72 dpi -> 300 DPI). - Fix the mixed import of multiple page and single page files. - Various crashes and bugs were fixed.
-
February 5, 2018
LIMB™ Release Notes 3
LIMB 4.1.0 WHAT’s NEW! 15/12/2016
Features - Advanced manual selection process. - New connection to metadata server through HTTP. - Page detection process can be done on custom color background based on HSL values.
Improvements - Multicore management on batch processing and OCR. - JP2K output is now based on OpenJpeg library. - Add image list defects to the QC report export format. - Default method for resize process is now “Cubic”.
Bug Fixes - Fix JP2K export slowness with greyscale images. - Fix OCR result incorrect position of ":" in arabic language. - Fix ICC profile usage that decrease image resolution to 72 dpi. - Fix automatic pages detection where right pages are detected as left pages. - Fix copy and paste in custom fields that cut off text to 255 characters. - Fix curvature correction that is not applied on image due to missing VC redist in the installer. - Fix parallel jobs during OCR step that do not reach the maximum number of cores. - Fix issue with publishing TOC files to Gallery that transfer 2 TOC files. - Fix issue with mixed output that do not export TIFF G4 due to temporary files. - Fix image conversion to B&W that keep image depth to 8 bits. - Fix issue with multiple PDF import and mix of pages order. - Fix PDF exports error when processing large batches. - Fix missing image resolution with JP2K output. - Various crashes and bugs were fixed.
-
February 5, 2018
LIMB™ Release Notes 4
LIMB 4.0.0 WHAT’s NEW! 30/06/2016
Features - Civil Register Module (as an option). - Advanced Custom Fields (as an option).
- Metadata entry window construction based on an XML file. - New XML and HTML output for document and image custom fields - Automatic structure completion based on image tags. - Improve connection between Limb Processing and Limb Gallery: - Add "Publish" option. - Choose Limb Gallery Template, language and item type to import metadata. - Choose several format to publish - New Raw XML output for document metadata. - Apply XSLT stylesheet to RAW XML output.
Improvements - Full screen mode window in QC step. - Add a shortcut (CTRL+D) for the erase inside process. - Add information about all selected language in the OCR settings step. - Add option to TOC file for adding or not the physical page number. - METS output : - Add SHA-1, CRC32, MD5 checksum in METS. - Add tech MD on derivatives in METS. - Add JP2 creation date in METS. - Add error message when OCR counter is empty. - Select between two ABBYY dongle. - Client lite installer for the server version. - Center process has new option to disable detection
Bug Fixes - Fix some auto Pagination crash in page validation step. - Fix some auto pagination issue with large books over 1000 pages that do not give results. - Fix translation for Segmentation module buttons. - Fix inverted word display in OCR QC for Arabic document with IRIS. - Fix PDF import into Hot Folder that do not start. - Fix issue with multiple single page PDF files during import step. - Fix search attribute from the import catalog method that is not saved. - Fix ALTO File extension that is not correct in publishing step (Limb Gallery). - Fix missing for JP2 in METS. - Fix segmentation module crash with specific document. - Fix OCR User Dictionary that is not saved with the template. - Fix Segmentation module low resolution display. - Fix manual pagination window crash if user remove the step value. - Fix error with Working Folder temporary directory creation. - Fix "filename" value from ALTO file is different from image filename. - Fix the only 4 languages available with ABBYY ocr on client computer with server version. - Fix the crash when project is empty and adding an image in the QC step. - Fix synchronization issue with processed image display after some QC processes. - Various crashes and bugs were fixed.
-
February 5, 2018
LIMB™ Release Notes 5
LIMB 3.3.2 WHAT’s NEW! 15/01/2016
Features - Automatically extract Bookmarks from imported PDF and retrieve the information into the structuring. - Customization of folio pagination. Added prefix and suffix for left and right pages. - Added customization of the search attribute for the Z39.50 protocol.
Improvements - Added full image resolution in structuring step viewer. - Page detection process improvement for dark images. - Improvement of ALTO output in order to pass XSD validation.
Bug Fixes - Fix the issue with two exported METS files in different location. - Fix missing XML header for exported DC file. - Fix error in MRC entries with some fields. - Fix the crash issue that create a never ending process. - Various crashes and bugs were fixed.
LIMB 3.3.1 WHAT’s NEW! 15/10/2015
Improvements - Manage case sensitive metadata server on OAI protocol. - Modify software version to ALTO file - Added full image resolution in document validation viewer.
Bug Fixes - Fix the issue with only 5 languages available in ABBYY OCR. - Fix the Segmentation module restricted access for desktop users. - Various crashes and bugs were fixed.
-
February 5, 2018
LIMB™ Release Notes 6
LIMB 3.3 WHAT’s NEW! 21/09/2015
Features - New custom folio pagination available into the page validation tool. - Physical page number has been added to the table of content (TOC) output. - Import step can manage automatic metadata with CSV and manual entries at the same time.
Improvements - OCR engine name has been added to the OCR statistics output. - New keyboard shortcuts for segmentation module to add text zone. - Import step detects 0 kb files to improve speed process. - Spanish characters display has been improved into the MARC record automatic retrieval form. - Added .mrc format to automatic imported metadata files.
Bug Fixes - ABBYY OCR available languages are updated from the license and not from the template. - Fix bug where project with the same name will not go into the segmentation module. - Fix segmentation shortcuts not available. - Fix bug with missing text entry into METS file output. - Various crashes and bugs were fixed.
-
February 5, 2018
LIMB™ Release Notes 7
LIMB 3.2 WHAT’s NEW! 24/06/2015
Features - New OCR segmentation correction module. - New export output with OCR Statistics and advanced configuration. - Metadata information (EXIF, IPTC, TAGS) can be included into the export step or CSV output. - Manual page numbering can be done with a specific step.
Improvements - Add a "delete" option for the ICC files. - Add Turkish language to the installer. - New API connection to Limb Gallery.
Bug Fixes - Fix page numbering issue with ABBYY OCR. - Apply "/" path separator in METS configuration to mix:objectIdentifierValue. - Tag list in the quality control is empty when using the “Hot” folder. - Missing tags in the sampling step when using “Hot” folder. - Multi zone detection process not properly initialized. - Gamma correction process not properly initialized. - Fix METS Validation schema error. - Add multi dongle management and counter to ABBYY11. - Various crashes and bugs were fixed.
LIMB 3.1 WHAT’s NEW! 30/03/2015
Features - Update ABBYY v10 with old languages (French, German, Spanish, Italian and Gothic ). Requires a specific
ABBYY license. - Manage several ABBYY licenses (or hardware dongle). - New API connection to Limb Maestro.
Improvements - Improve page validation tool for books bigger than 1000 pages. - Improve multi zones detection. Remove the maximum limitation. - Improve error management during export step. - Improve pause mode management for multiple jobs during the OCR step.
Bug Fixes - Correct the IPTC “Country-Primary Location Code” that needs at least 3 digits. - Correct missing IPTC values in the exported file. - Correct user interface issue with Italian language - Correct missing sliders in the general settings for “Manual Processing Number of Cores” in desktop
edition. - Various crashes and bugs were fixed
-
February 5, 2018
LIMB™ Release Notes 8
LIMB 3.0 WHAT’s NEW! 03/02/2015
Features - New OCR Correction Module
- Added a new Multiple Zone Detection tool
- New CSV advanced export
- Added support for PDF/A-2B and PDF/A-3B
- Cores management for OCR and Processing
Improvements - Improved the metadata panel with support for transformation stylesheets
- Added additional export format naming options
- Improved support for custom metadata fields
- Added the ability to use a custom dictionary with ABBYY OCR engines
- Improvements to batch processing
- Improvements to page detection
- Use jHove for exporting JPEG and TIFF Mix files
- Improved resize methods
- Improved global memory management
Bug Fixes - Fixed bug related to missing ALTO files
- Various crashes and bugs were fixed
LIMB 2.3.1 WHAT’S NEW! 04/09/2014
Bug Fixes & Improvements
- Improvements to Image Processing - Improved OCR Scheduler behavior - Fixed bug related to resizing color images in a PDF - Fixed error where some users could not retrieve Marc records - Fixed various bugs related to IPTC and CSV metadata - Fixed memory leak when processing 48bit images - Fixed bug when manually starting the OCR step that was paused by the OCR Scheduler - Various crashes and other bugs were fixed
-
February 5, 2018
LIMB™ Release Notes 9
LIMB 2.3 WHAT’S NEW! 25/06/2014
Features - Optimized Working Folder – The working folder with temporary files is now optimized to minimize its
size and increase processing speed. In the general settings of the application you can choose between 3 modes
1. Non-destructive: all temporary files are saved as TIFFS 2. Optimized: all temporary files are saved as JPEG (compression ration can be set) 3. Automatic (default mode): temporary files are saved as JPGs if the input format is a lossy
format such as a JPG or the temporary files are saved as TIFFs if the input format is a lossless format.
- EXIF/IPTC Metadata Handling - Added new feature to embed metadata in the header of the images (Exif & IPTC fields). Metadata can be set manually or dynamically using the descriptive metadata
- ICC Profile Management - New Export option to attach ICC profile to an image - LIMB API – New documentation available for the API of LIMB. Those API enable to control LIMB from a
third party tool. http://www.i2s-cloud.com/public.php?service=files&t=1310379170fac6ba239f4ec63035fba8
- Export Path Settings – New settings for the export path : it now possible to set a path for each format individually allowing to export one format to a sever and another format to another server as an example
- Page Border Deskew – New deskew option to deskew based on page borders in addition to the current deskew method based on text lines
- Remove Borders after Deskew – New option to remove borders after deskew
Bug Fixes & Improvements
- Improvements in page detection algorythm : detection of pages on clear background and general improvements
- Ability to rename images during QC - Corrected bugs related to the LIMB Server Client/Server mode - Fixed black and gray borders sometimes seen after apply curvature correction to the image - Fixed various crashes
http://www.i2s-cloud.com/public.php?service=files&t=1310379170fac6ba239f4ec63035fba8http://www.i2s-cloud.com/public.php?service=files&t=1310379170fac6ba239f4ec63035fba8
-
February 5, 2018
LIMB™ Release Notes 10
LIMB 2.2.2 WHAT’S NEW!
Bug Fixes & Improvements
- Improved ABBYY11 processing and consistency - Fixed crashes sometimes seen when opening and closing the application - Fixed Auto upgrade button - Fixed auto deskew not updating the large image in the Quality Control UI - Fixed DPI issues when Curvature correction tool was applied - Add warning to Automatic Pagination tool if OCR engine is not detected - Fixed manual metadata entry problem and improved import metadata behavior with various fixes - Fixed error where OCR engine displayed in list even if the engine was not available - Various crashes and bugs
LIMB 2.2.1 WHAT’S NEW! 18/03/2014
Features - Automatic Color Mode – Added new Automatic Color Mode which automatically sets the color depth
for the image using a histogram. If the image contains color or grayscale then the image is converted
accordingly. If there is no color or gray level then it is reduced to black and white
- PDF Table of Contents Highlight - Added option to PDF export to hyperlink the table of contents in the
PDF output
- Auto Table of Contents Zone Editing - Auto Table of Contents now supports manually editing the
highlight zone for the PDF
- Image Side Tagging - Added support for tagging the image side (left/right) in manual Quality Control
- OCR QA Image and Text Review - New text and image highlight in OCR QA based on the OCR output
- Advanced PDF Export - Added advanced PDF output options to export the PDF with different embedded
image formats and settings based on the bit depth of the input file
- LIMB Server Feature - OCR can now be executed on a separate, dedicated server
Bug Fixes & Improvements - Multiple Manual User Interfaces can now be opened at the same time (i.e. manual quality control,
structuring)
- Added option to METS output for automatically validating the METS schema
- Improved manual deskew tool
- Improved advanced configuration
- Added warning when importing a text only PDF (currently not supported)
- Images with embedded ICC profiles caused image processing to fail
- Fixed export error for books that were sent back to QC from the sampling step
- Accent marks not supported in watermarking
- Cannot export Word document if the images are processed or tagged at 96 DPI or less
- Test connection button for Marc record server retrieval does not show any results
- Fixed ALTO validation errors
- Crashes and errors related to the image processing list in Quality Control
- Various crashes
-
February 5, 2018
LIMB™ Release Notes 11
LIMB 2.1.1 WHAT’S NEW!
Features - MIX Export – Generate MIX files linked to a specific image export format
Bug Fixes & Improvements - Corrected METS file validation errors
- Fixed memory leak in Automatic Quality Control
- Minor Padding tool bug fix
LIMB 2.1.0 WHAT’S NEW! 31/10/2013
Features - Metadata Import – New metadata import interface. Includes automatic metadata retrieval from Z39.50
and OAI servers, CSV matcher, retrieval from a master CSV file and verification for the presence of
metadata in the job folder
- ALTO Generation – Generate ALTO files linked to an export format
- BNF ALTO – Added support for BNF ALTO output
- METS Improvements – Improvements to the METS output and added flexibility
Bug Fixes & Improvements - Fix manual split function in Quality Control interface
- Fixed TIFF G4 images appearing inverted in some viewers
- Fixed some images appearing inverted after a image processing tool was applied
- MODS, MARC and DC model improvements and bug fixes
- Allow two key fields to be added to the MODS, MARC, DC and CSV outputs
- PDF title now shows in title bar instead of "Unknown"
- Padding tool now allows the user to input by pixels, mm or inches. Other Padding Tool bug fixes
- Delete from finish queue and dashboard now deletes working folder from file system
- Added detection of page slope options to detect page region
- Resize image tool improvements
- Light correction tool improvements
- Page Validation tool thumbnails now includes the original OCR'd number in a light blue box
- Licensing improvements and bug fixes
- Fixed reprocessing a book from finished queue and deleting images resulted in export error
- Removed check that forced user to select OCR for outputs that required OCR
- Various crashes fixed
-
February 5, 2018
LIMB™ Release Notes 12
LIMB 2.0.1 WHAT’S NEW!
Features - ABBYY 10 OCR Engine – Added compatibility with ABBYY 10 OCR Engine
Bug Fixes & Improvements - Minor bug fixes
LIMB 2.0 WHAT’S NEW! 13/06/2013
Features - Hot folder Monitoring - New queue for monitoring scanned books that will be automatically added to
the queue with a pre-determined workflow.
- Mixed Export mode - Exports one set of images with different customizable image formats in the same
output folder.
- Document validation - New step for verifying all the pages of the document are captured. Feature uses
OCR engine for page number verification.
- Support for Word document output
- Support for XML document output
- Can now run more than one job simultaneously. Option is available in the settings.
- Current and Finished Dashboard Queues can now be filtered. Feature can be accessed by hovering
over the Status section of the dashboard toolbar.
- Notes - New note function now available on the dashboard and the manual user interfaces.
Bug Fixes & Improvements
- Added option to export output inside the input folder
- QC performance improvements
- LIMB processing improvements
- Added previous sample button and keyboard shortcuts to sampling UI
- CR2 processing improvements
- Sampling preview improvements
- Bug fixes in margin detection and center detection
- Improved PDF import and processing
- Export processing time improved
- General LIMB UI changes and enhancements
- Improved memory management in LIMB
- Manual QC can now be unchecked separately from other QC options
- Fixed various crashes reported to LIMB support
-
February 5, 2018
LIMB™ Release Notes 13
LIMB 1.2 WHAT’S NEW! 19/03/2013
Features - YOOLIB™ Integration – Added support for sending documents directly to YOOLIB from the
LIMB interface. User can choose which categories or collections to send the document to from
within the LIMB workflow creator.
- OCR Quality Assurance Interface – Allows the user to see information about the OCR output
for the job.
- Document Reprocessing – Jobs can now be reprocessed from the finished queue. Job can be
sent back to all steps available in LIMB including the template phase.
- Push Back – Jobs can be pushed back while in the Current queue. Options can be accessed from
the right-click menu or the side panel.
- OCR Table of Contents Improvement – User can now select the area that OCR will be applied
to on the table of contents page. This can be done in the OCR TOC window by simply drawing a
new box on the table of contents image. User can draw multiple boxes if needed.
- Option to QC Multiple Jobs at Once – New option to combine all input documents into one job.
All images for the jobs will appear in the QC window at the same time. The jobs will be
exported as separate folders.
- Detection for Incorrect Crop Boxes – A new option is now available under the Quality Control
Tab in the workflow creator for flagging image that may have bad crop locations. This is only
available if the automatic page detection tool is used.
- Input Documents – Added the new option to the workflow creator to input documents. User
can now select a document directly from a folder.
Bug Fixes & Improvements
- Improvements made to the QC interface to make it more responsive.
- Fixes for timeouts to the licensing server and error of insufficient funds for OCR.
- Fixed bug in the rescan tool in QC
-
February 5, 2018
LIMB™ Release Notes 14
LIMB 1.0.1.1 What's New!
Bug Fixes - When selection a page from the structure tree for Auto table of Contents, the OCR Engine is not
initialized properly.
- Various crashes
LIMB 1.0.0.0 What's New!
Features - Added Link to the LIMB YouTube Tutorial page – Videos soon to come -
Improvements - Improved stability when processing a large set of images.
- New Icons for the Copy and Delete functions that are available in Left/Right Processing mode under the Image Processing tab.
- Add a checkbox to activate or deactivate Automatic Quality Control.
Bug Fixes - When an Image-on-Text PDF is resized the XY coordinates for the word location is incorrect.
- Added labels to the shortcuts in Structuring.
- Added more translations for French user interface.
- New icons for the all button in Quality Control and Structuring