Gnostice.DocumentStudio.OCR 19.2.1

This package adds OCR capability to Gnostice Document Studio .NET product.
Note that this is an add-on package and therefore should be used along with other packages of Gnostice Document Studio packages.

There is a newer version of this package available.
See the version list below for details.
Install-Package Gnostice.DocumentStudio.OCR -Version 19.2.1
dotnet add package Gnostice.DocumentStudio.OCR --version 19.2.1
<PackageReference Include="Gnostice.DocumentStudio.OCR" Version="19.2.1" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Gnostice.DocumentStudio.OCR --version 19.2.1
The NuGet Team does not provide support for this client. Please contact its maintainers for support.
#r "nuget: Gnostice.DocumentStudio.OCR, 19.2.1"
#r directive can be used in F# Interactive, C# scripting and .NET Interactive. Copy this into the interactive tool or source code of the script to reference the package.
// Install Gnostice.DocumentStudio.OCR as a Cake Addin
#addin nuget:?package=Gnostice.DocumentStudio.OCR&version=19.2.1

// Install Gnostice.DocumentStudio.OCR as a Cake Tool
#tool nuget:?package=Gnostice.DocumentStudio.OCR&version=19.2.1
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Follow this link for the Gnostice Document Studio .NET Developer Guide.

Follow this link for the Gnostice Document Studio .NET Developer Guide.

Release Notes

Version 2019 R2 - September 23, 2019
====================================
Introduced
~~~~~~~~~~
- PDF Engine
- A newly designed PDF engine is introduced that works across platforms. Since this engine is not yet at parity with the feature-set of the old PDF engine, it is enabled only for .NET Standard. So it is used by the new Xamarin document viewer control and also can be used in .NET Core / .NET Core 3 frameworks. Once its feature-set matches that of the old PDF engine, the old engine will be replaced with the new one for all platforms.
- Features currently supported by the new PDF engine
- Shapes
- Text
- Supported font types: Type 1, TrueType, CFF, CID
- Images
- Supported image types: 8 BPC BMP and 8 BPC JPEG
- Decryption
- Supported encryption algorithms: RC4 (40 bit and 128 bit) and AES (128 bit and 256 bit)
- Stream decompression filter types:
- Flate
- Xamarin Document Viewer Control
- A new Document Viewer control is introduced for Xamarin.Forms mobile apps (Android and iOS).
- Features currently supported the control
- Page layouts
- Continuous: Single-page, Two-page, Fit-to-window
- Scroll orientation
- Vertical, Horizontal
- Navigation
- First-page, Previous-page, Next-page, Last-page, Goto-page
- Zoom
- Zoom-in, Zoom-out, Fit-width, Fit-height, Custom-zoom (unlimited)
- Rotation
- Specific pages and all pages
- Gestures
- Swipe, Pan and Double-tap.
- Events
- PageChanged
- PageCountChanged
- ZoomChanged
- NeedPassword
- Product Licensing
- The old registration-key based product licensing system has been replaced with an activation-key based licensing system. See the documentation for more details.

Enhanced
~~~~~~~~
- Document Engines
- All existing document engines (except PDF) are now supported on .NET Standard compliant frameworks (.NET Core and Xamarin as well).
- Updated file format prediction code to improve detection of supported files.
- Digitization Engine
- The digitization engine (OCR) is now supported on .NET Standard compliant frameworks as well.
- Document Converter
- The Document Converter component now works both in .NET Framework as well as .NET Standard.
- Document Mail-merge
- The Mail-merge component now works both in .NET Framework as well as .NET Standard.
- Document Viewers
- Both the WinForms and WPF viewer controls have been redesigned and rewritten from the ground-up.
- The newly introduced Xamarin Document Viewer control, and the rewritten WinForms and WPF Document Viewer controls share bulk of their code via the newly introduced PageManager module. This new design allows for faster introduction of features into all viewers at the same time.
- The WPF viewer now supports navigation via thumbnails as well as PDF bookmarks.
- Word formats
- Header and footer content is now rendered with a faded color to match Word behavior.
- Spreadsheet formats
- File is now read in a worker thread and content is rendered to pages

Fixed
~~~~~~
- PDF
- DCT images having Separation Colorspace with Type 0 (Sampled) Function are rendered with inverted colors..
- JPX image having JPX image as SMask are not rendered.
- Shapes having Separation Colorspace with Type 0 (Sampled) Function are rendered incorrectly.
- Rendering DCT images having JPX image as SMask.
- Rendering of image that is compressed using Flate with Predictor value '15' takes longer than expected.
- Text extraction results in extraction of incorrect text for CID font text when the CID font contains ToUnicode stream with "beginbfrange" lines that specify array of Unicode values for the given range.
- Random errors observed in appearance of transparent text when converting to PDF.
- Deletion of annotations in the document leads to generation of corrupt PDF output.
- NRE thrown when fetching form fields for some PDF files.
- Form fields with unspecified field type are rendered twice.
- Text fields are not displayed after saving due to incorrect encoding text field value for some documents.
- Form fields are rendered at incorrect position due to incorrect handling of Matrix entry in the field appearance stream.
- Multiple embedded font text rendering fixes by the font engine.
- Word formats
- Picture watermarks are not being rendered correctly.
- NRE thrown when loading some DOCX files containing fields.
- NRE thrown when loading TXT files containing null characters.
- Rendering issue of fields where a width change isn’t accounted for when the field value is changed.
- NRE when rendering some DOC files with a table.
- Tab stops of type “start” are not handled.
- Hang when rendering a table containing a page break.
- Highlight color not parsed properly in DOC files.
- Spreadsheet formats
- Rich text parsing issue in XLS.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories

This package is not used by any popular GitHub repositories.

Version History

Version Downloads Last updated
21.1.870 101 2/4/2021
21.1.867 86 2/3/2021
20.1.629 254 6/16/2020
19.3.0 222 11/25/2019
19.2.1 220 9/24/2019
19.2.0 225 9/23/2019