Best file format for scanning documents


















PDF files are the best option for text documents, shapes, or different images containing text. A few programs, similar to Adobe Acrobat, utilize OCR innovation to scan the real characters of the text so you can edit the text later and search for words or expressions in the document. Since the PDF format incorporates programmed image pressure, it is a good decision for scanned images too.

PNG was created later than the others, with the intention of sidestepping LZW pressure patent issues with GIF, and since it is more current, it offers different choices as well RGB shading modes, 16 bits, and so on.

That patent circumstance has left now, however PNG stays great lossless pressure. When tif files are processed many software packages give better results than when pdf files are used.

In the end the choice of format depends upon how you are going to use the files, what protections you have built into your system and the reliablity you can achieve with any format choice. Testing to be sure you get what you want is always a good idea and a well designed process will be the result. In most circumstances we find that users are happy with pdf and the system can be created to manage the risks associated with data manipulation if that is of concern.

Setting this type of system up takes good knowledge of the ins and outs of document management , something that comes with experience and working through many installations.

Topics: Scanning , document filing , document storage strategy , software tools , document management. This blog discusses ways to optimize and improve the management and production of documents used in your business.

You can scan multi-page documents as a single page with TIFF. Most people find it simpler to just stick to PDF to handle their scanning needs. Your email address will not be published.

Save my name, email, and website in this browser for the next time I comment. Multi-language Translation System. Export Information. Responsible Disclosure Policy. It can do this in a variety of methods, such as combining similarly colored pixels into a single color to reduce the amount of data stored. Historically, lossless compression would result in a larger file size when compared to a file using lossy compression. However, with new compression algorithms, lossless compressed files are now manageable in size and preserve data better than ever.

An uncompressed image is one in which no file level or image level compression has been applied. While common in the digital photography world, uncompressed file formats are typically not used in archiving content from paper, film or microfilm content. The resulting file size would create major roadblocks in terms of file storage and sharing of information. Multiply that over a collection of millions of records and the technical requirements for this become unwieldy.

File Formats and Commonly Supported Compression The chart below outlines the compression methods typically used in document scanning applications. While the file formats listed below may support additional formats, it is uncommon in scanning applications and long-term archiving.

When assessing a records collection, one of the factors to consider is the generation level of the content. It should come as no surprise that the further removed from the original document, through photocopying, scanning, reprinting, etc. A common example of this is paper record or drawing that has been frequently handled over time.

The document accumulates dirt around the corners from normal handling and once photocopied, those muddied areas become black and almost unreadable. The same is true, and often magnified, for legacy media such as microfilm or microfiche when they are duplicated. Duplicate copies often do not have the same density levels as the originals and frequently have scratches that have developed through normal use of the media.

Many copies turn out almost distinguishable from the original records. If the contrast between documents varies greatly throughout a collection, you may want to consider capturing the content in greyscale mode to pick-up more detail.

Used on documents and drawings with only minor deviations in contrast throughout the collection. Bitonal is well suited for documents where there is no importance to the colors on the documents i. Bitonal is not suited well for poor contrast documents i. Greyscale is recommended for files which would suffer information loss if scanned in bitonal mode. For example, poor contrast documents, photographs where the color is not critical , documents with a wide-range of variation, microfilm and microfiche.

Color mode will typically not return more detail then greyscale on poor contrast files and is only recommended when color has a significant relevance to the information on the document. Color scanning results in a much larger digital image, which can make it difficult to share images due to slow transfer speeds and can create network storage problems.

Another option to consider to reduce file size is the use of automatic color detection.



0コメント

  • 1000 / 1000