Scan, Import, and Automatically File documents to SkyDrive with ccScan

15
Capture Components, LLC White Paper Page 1 of 15 Copyright © 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. Scanning to SkyDrive with ccScan Document Capture to the Cloud Overview This white paper shows how ccScan, a production-quality document capture application, can be used to capture documents to SkyDrive in an automated, time-saving manner. Microsoft SkyDrive is a free hosting service that allows users to upload files to cloud storage and then access them from a Web browser or smart phone app. SkyDrive is part of the Windows Live series of online services and allows users to keep their files private, share them, or make them public. Use free companions to Microsoft Word, Excel, PowerPoint, and OneNote to view, edit, share, and work together with anyone on documents stored on SkyDrive. Since SkyDrive saves different versions automatically older versions of online documents remain available. Since there is a lot more to SkyDrive than this short introduction can cover please visit the SkyDrive website at http://windows.microsoft.com/en-us/skydrive/home for in-depth formation. Although SkyDrive customers can easily upload their documents if they are already in digital format, paper documents must be scanned first and then manually uploaded. ccScan eliminates the time typically wasted with such a multi-step, manual operation that consists of scanning a document locally to the PC, naming this document, creating a folder in SkyDrive, uploading the document to SkyDrive, and finally setting document properties such as description. Instead ccScan does all of the above in a single-step operation that is fully automated. Naming of documents and folders can be manual or automated by using barcodes or running counters. In addition to scanning paper documents, ccScan has bulk import capabilities for already digitized documents that allow entire folders to be uploaded to SkyDrive in a single step using wild card file specifications and recursion. 32158 Camino Capistrano Suite A PMB 373 San Juan Capistrano, CA 92675 [email protected] www.CaptureComponents.com

description

ccScan Advanced for SkyDrive is best explained with some examples: run unattended jobs to import electronic faxes, upload them to automatically selected SkyDrive folders, and automatically name the document, all based on information extracted from barcodes and text pattern searches in text obtained through OCR. Or scan documents and automatically create document names and descriptions. Since ccScan is highly configurable it can be applied to many scenarios where opportunities for automation and large time savings are present. ccScan Advanced features are based upon sophisticated technologies such as barcode detection, OCR, and Text Pattern search with Regular Expressions. These capabilities are applied to both the scanning of paper documents in the paper-based office and the processing of electronic documents in the paperless office. ccScan Standard for SkyDrive is similar to ccScan Advanced without the automation capabilities. ccScan Standard is typically used in a paper-based office to efficiently scan paper documents to SkyDrive. Use ccScan Standard to eliminate wasted time and the following extra steps: scan a document to your PC using any Twain scanner, name the document, create a folder in SkyDrive, upload the document to SkyDrive, and finally setting up SkyDrive document properties such as Description, Visibility, and Sharing mode. Instead, ccScan Standard does all of the above in a single-step operation.

Transcript of Scan, Import, and Automatically File documents to SkyDrive with ccScan

  • 1. Capture Components, LLC White PaperPage 1 of 1532158 Camino CapistranoSuite A PMB 373San Juan Capistrano, CA 92675Sales@CaptureComponents.comwww.CaptureComponents.comScanning to SkyDrive with ccScanDocument Capture to the CloudOverviewThis white paper shows how ccScan, a production-quality document capture application,can be used to capture documents to SkyDrive in an automated, time-saving manner.Microsoft SkyDrive is a free hosting service that allows users to upload files to cloudstorage and then access them from a Web browser or smart phone app. SkyDrive is part ofthe Windows Live series of online services and allows users to keep their files private,share them, or make them public. Use free companions to Microsoft Word, Excel,PowerPoint, and OneNote to view, edit, share, and work together with anyone ondocuments stored on SkyDrive. Since SkyDrive saves different versions automatically olderversions of online documents remain available. Since there is a lot more to SkyDrive thanthis short introduction can cover please visit the SkyDrive website athttp://windows.microsoft.com/en-us/skydrive/home for in-depth formation.Although SkyDrive customers can easily upload their documents if they are already indigital format, paper documents must be scanned first and then manually uploaded. ccScaneliminates the time typically wasted with such a multi-step, manual operation that consistsof scanning a document locally to the PC, naming this document, creating a folder inSkyDrive, uploading the document to SkyDrive, and finally setting document propertiessuch as description. Instead ccScan does all of the above in a single-step operation that isfully automated. Naming of documents and folders can be manual or automated by usingbarcodes or running counters. In addition to scanning paper documents, ccScan has bulkimport capabilities for already digitized documents that allow entire folders to be uploadedto SkyDrive in a single step using wild card file specifications and recursion.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper.

2. Capture Components, LLC White PaperPage 2 of 15ccScan for SkyDriveCapture Components, LLC markets different versions of ccScan targeted for differentdocument repositories. The ccScan versions for Internet Cloud based storage supportGoogle Drive and Docs, Box, Salesforce, Dropbox, Microsoft SkyDrive, Amazon S3,Windows Azure, etc, but other types of document repositories are supported as well. Forexample, documents can be captured remotely and stored in Microsoft SharePoint andSharePoint Online using SharePoints Internet-based web services interface. AdditionallyccScan works with the Microsoft Windows file system utilizing its local hard drive ornetworked folders for storage.Although ccScan costs significantly less than competing products, it is a full-featured,production-level, document capture system. The software takes full advantage of todaysmulti-core processors. On a quad-core computer the scanning, barcode detection, textrecognition, and export to SkyDrive each run on a separate core to maximize throughput.Consequently most TWAIN-compatible production scanners are operated at rated speed.Since ccScan has been designed to be enterprise-friendly, it contains a Setup mode and aUser mode. The Setup mode is password protected. Jobs are created in Setup mode and canbe used over and over again in User mode. To make sure ccScan can be used immediatelyafter installation, without having to create a job first, a set of default jobs is provided tocover common document capture scenarios.Jobs can be associated with a scanner button, which can make running a job in User modeas simple as pressing a button on the scanner. ccScan also contains an Index/QA mode thatallows the operator to check the document or enter indexing data before the document issent to SkyDrive. Documents are automatically separated if barcode sheets are used.On platforms that contain no native document viewing options such as Amazon S3 andWindows Azure, ccScan provides the capability to create View jobs that will list alldocuments entries or a subset in either a sequential list or hierarchical format. View jobs,however, are not enabled for SkyDrive since SkyDrive natively provides documentviewing capabilities.ccScan uses the SkyDrive REST API to upload documents to SkyDrive. The SkyDriveREST API is also used to create, rename, and delete folders. Although there is no need toknow anything about the API to use ccScan, SkyDrive users interested in learning moreabout this API can visit http://msdn.microsoft.com/en-us/library/live/hh243648.aspx.Despite its low price, ccScan has all the high-end features and no restrictions such as pageclick charges. For detailed information about ccScan or to download a free trial version,please visit Capture Components website at www.capturecomponents.com.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 3. Capture Components, LLC White Paper Page 3 of 15Typical ScenarioTo give the reader a flavor of capturing and storing documents in SkyDrive with ccScanthe following sections contain walkthroughs annotated with screenshots of a typicalscanning and bulk import scenario. Although the default set of jobs installed with ccScancover capturing and storing documents in SkyDrive, the example scenarios assume that noccScan Jobs have been created previously so both Setup and User modes can be shown.For ease of illustration certain details such as the creation of an account with SkyDrive orhow to enter Setup mode have been left out of these descriptive scenarios. Create Scanning JobTo start a new scanning job must be created. In ccScan jobs are created in Setup mode.Once a job has been created, it can be used over and over again in regular User mode. TheAdministrator selects default values for the various job settings and decides which jobsettings can be entered or modified by the operator. The job settings are organized bycategory such as Scanner settings or Export settings.Figure 1 - Create New Scanning JobIn the dialog box above SkyDrive has been selected as the repository for export of thecaptured documents. If ccScan were licensed for multiple repositories, the target repositoryfor the job would be selected here from the list of available repositories. Note that the JobType is Scan, Export.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 4. Capture Components, LLC White Paper Page 4 of 15 Enter Windows Live CredentialsImmediately after the new job has been created, the dialog below comes up to enter theuser name and password for a Windows Live account.Both in User mode and Setup mode the operator must enter an email/username andpassword of an active Windows Live account. When the Login button is pressed ccScanobtains an access token for SkyDrive that is subsequently used to request information fromSkyDrive such as a list of existing folders on SkyDrive. Figure 2 Windows Live Login DialogTo prevent users from having to login each time ccScan is started, the obtained accesstoken is stored in between ccScan sessions in an encrypted file that can only be read byccScan and only on the PC where it was created. At any time the user has the option torevoke the authorization token on the Windows Live website. Note that ccScan can beconfigured to disable the default auto-logon which requires that users re-enter theirWindows Live credentials each time they use ccScan.To allow the administrator flexibility jobs can be configured with a time-out. If the job hasnot been run for e.g. two minutes, the currently logged-in user is automatically logged outand the dialog above will come up when the job is started again.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 5. Capture Components, LLC White PaperPage 5 of 15 Configure New Scanning JobIn the form below the Administrator can configure the new Scanning job. The same form isused to make modifications to the job later. The name of the newly scanned document isentered in Document Name. Figure 3 - Configure New Scanning JobTo allow for automated naming, ccScan supports the use of macros in document andfolder names that automatically insert counter and barcode values. Automated naming isused with high-capacity scanners whose hoppers can hold large stacks of documentsseparated by barcode sheets.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 6. Capture Components, LLC White Paper Page 6 of 15Optionally enter a description of the document in Description. The name of a folder locatedon SkyDrive can be entered in SkyDrive Folder (or selected from the list of existingfolders). When the specified folder does not exist, ccScan will create it. When PreventOverwrite is checked, SkyDrive will reject the uploaded document when a document of thesame name already exists in the selected folder. This option prevents unintentionaldocument overwrites.Job Expiration can be set to the number of seconds of idle time after which the logged-inSkyDrive user is automatically logged out. The default setting of zero disables thisautomatic logout.When Shareable Link is set to a selection other than None, ccScan will request a shareablelink from SkyDrive for the uploaded document. The requested link is returned in the read-only field Document URL that could be copied and pasted. For example, the URL could becopied in an email to share the document with anyone, even non-SkyDrive users. SkyDrivemakes the following selections available for shareable links: read-only URL, read-writeURL, and embedded HTML. The latter option is typically used to request an HTMLfragment containing a link that can be pasted in the HTML of a webpage served from aweb server.This return value is also recorded in the daily log file that ccScan produces to allowautomated post-processing. This JSON format log file contains information about all jobsthat were run on a particular day and the SkyDrive return values are recorded for eachdocument in the job. Typically this capability is used in environments where SkyDrive isused for document storage but links to the documents are stored in a database.For each setting, the Administrator selects or enters a value. Default values are providedwhere possible. The Administrator can also make settings available for the operator toenter, except for the Administrator-only settings. For the settings to be entered by theoperator, the Administrator determines the order in which these settings are shown,optionally modifies the default description of each setting, and selects which type of inputcontrol should be presented to the operator.After the Export Settings, the Scanner Settings are entered. The scanning of documents inccScan is based on the TWAIN standard. The scanner manufacturer provides TWAINdrivers. All the scanner settings you would expect in a production-level document capturesystem are provided: basic settings such as export format, resolution, paper size, andadvanced settings such as barcode detection, which for example, can be used toautomatically separate documents or automatically specify folder names. Jobs can also beassociated with a button at the scanner front panel so that the job is started automaticallywith a press of the button.TWAIN settings highly specific to the type of scanner can optionally be set with theTWAIN dialog box. These settings are saved locally on the computer and associated withthe job. For those settings common between ccScan and the TWAIN dialog box, theccScan settings can be given priority over the TWAIN settings. With this capability theadministrator can give the operator access to the TWAIN dialog box and still, for example,enforce company standards such as 200 DPI resolution.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 7. Capture Components, LLC White PaperPage 7 of 15 Run this Scanning JobOnce the job has been created it can be run over and over again in User mode. The job isstarted either from the scanner front panel or with the Start Job button in ccScan.Figure 4 - Run this Scanning JobTypically jobs are configured so that the operator must enter at least a document nameunless ccScans automated naming capabilities are used. Optionally the operator canchange default scanner settings or enter or select additional SkyDrive fields.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 8. Capture Components, LLC White PaperPage 8 of 15Once the job is started, ccScan disables the Start button and does not allow any changes tothe form. However, the operator is allowed to press the Cancel button. To provide visualfeedback the scanned pages are displayed while the job is running.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 9. Capture Components, LLC White PaperPage 9 of 15 Create Import JobTo start a new Import job it must be created. In ccScan jobs are created in Setup mode.Once a job has been created, it can be used over and over again in regular User mode. TheAdministrator selects default values for the various job settings and decides which jobsettings can be entered or modified by the operator. The job settings are organized bycategory such as Import settings or Export settings.Figure 5 - Create New Import JobIn the dialog box above SkyDrive has been selected as the repository for export of theimported documents. If ccScan was licensed for multiple repositories the target repositoryfor the job would be selected here from the list of available repositories. Note that the JobType is Import, Export. Enter Windows Live CredentialsImmediately after the new job has been created and the Windows Live credentials have notbeen entered yet, the email address and password for a Windows Live account must beentered with the dialog described earlier in this document.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 10. Capture Components, LLC White Paper Page 10 of 15 Configure New Import JobIn the form below the Administrator can configure the new Import job. The same form isused to make modifications to the job later.Figure 6 - Configure New Import JobThe name of the newly imported document is entered in Document Name. If DocumentName is left blank, the new document will get the same name as the imported file (or thename of the first file if multiple files are imported). All the SkyDrive fields mentionedearlier for scanning are available for import as well.For each setting, the Administrator selects or enters a value. Default values are providedwhere possible. The Administrator can also make settings available for the operator toCopyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 11. Capture Components, LLC White Paper Page 11 of 15enter, except for the Administrator-only settings. For the settings to be entered by theoperator, the Administrator determines the order in which these settings are shown,optionally modifies the default description of each setting, and selects what type of inputcontrol should be presented to the operator.After the Export Settings, the Import Settings are entered. First the type of import must beselected, either File Import or Bulk Import. File Import brings up the standard Windowsfile selection dialog and allows the user to navigate to a particular folder and select one ormore files to import.Bulk Import allows the selection of a folder and for ccScan to automatically walk throughthis folder (optionally recursively so all subfolders and their subfolders are included) andimport all files that match one more wildcards (such as *.*, *.pdf, ccitt_?.tif?, etc). Evenwildcards can be specified to exclude certain types of files. Also, for files containingscanned images only, there is an option to combine all files into a single file and importonly this combined file (optionally de-skewing the images first).In Bulk Import Folder either enter the name of the folder to traverse (or select from a list offolders). The wildcard specification to include files is entered in Bulk Inclusion Patternwhile the wildcard specification to exclude files is entered in Bulk Exclusion Pattern. Torecursively traverse subfolders enable Bulk Folder Recursion.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 12. Capture Components, LLC White PaperPage 12 of 15 Run this Import JobOnce the Import job has been created it can be run over and over again in User mode. Thejob is started with the Start Job button. Figure 7 - Run this Import JobTypically jobs are configured so that the operator must enter at least a document nameunless the new document must be named after the imported document. Optionally theCopyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 13. Capture Components, LLC White PaperPage 13 of 15operator can enter or select a SkyDrive folder name where the imported document will bestored.Once the job is started, ccScan disables the Start button and does not allow any changes tothe form. However, the operator is allowed to press the Cancel button. To provide visualfeedback, the imported pages are displayed while the job is running.Copyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 14. Capture Components, LLC White Paper Page 14 of 15 Annotation and RedactionIt is not uncommon to annotate or redact selected paragraphs of documents that are shared.For example, redaction is used routinely in the legal sector. To allow review, correction, orannotation of documents before they are stored in SkyDrive, ccScan provides an optionalIndex/QA mode. This mode can be enabled on a per-job basis by checking the Index/QALater field. When enabled the documents produced with a Scanning or Import job arecached locally and made available in the Index/QA tab of ccScan. Figure 8 - Index/QA ModeCopyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper. 15. Capture Components, LLC White PaperPage 15 of 15Before releasing the documents to SkyDrive from Index/QA, descriptive information canbe added to index fields, and pages can be deleted or inserted. Index information can beentered manually using the keyboard or extracted automatically from selected zones of theimage using OCR (Optical Character Recognition).In addition, pages can be annotated with text and selected portions of the pages could beredacted. Note that the annotations and redactions become permanent once they are savedand cannot be removed later.AfterwordWe at Capture Components LLC hope that this White Paper is of benefit to you. If it is and you areable to conveniently and efficiently capture and import your documents to SkyDrive, then we haveachieved a measure of success. If you choose to use ccScan as your document capture application,we will do our best to provide you with the highest quality product and support.Visit us at http://www.capturecomponents.comCopyright 2012-2013 by Capture Components, LLC. All rights reserved. Capture Components, LLC encourages the reader to freely share and distribute this White Paper.