Bytescout.PDFExtractor 11.2.1.3959

Bytescout PDF Extractor SDK for .NET, ASP.NET, ActiveX - extract data from PDF documents

Install-Package Bytescout.PDFExtractor -Version 11.2.1.3959
dotnet add package Bytescout.PDFExtractor --version 11.2.1.3959
<PackageReference Include="Bytescout.PDFExtractor" Version="11.2.1.3959" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add Bytescout.PDFExtractor --version 11.2.1.3959
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

Bytescout PDF Extractor SDK for .NET, ASP.NET, ActiveX.

ByteScout, Inc. (c) 2008-2020.

Compatibility: .NET Framework 2.0 or later; .NET Core 2.0 or later.
Works with: .NET, ASP.NET, ActiveX, Visual Basic 6, Classic ASP, Delphi and others.

Features:

- Extracts data from PDF files in TXT, CSV, XML, XLS, XLSX, JSON formats;
- Extracts embedded images, files and attachments from PDF files;
- Splits and merges PDF files, extracts a single page or range of pages;
- Extracts data from whole document page or specified rectangular region;
- Extracts PDF document information (author, subject, producer etc);
- Detects tables;
- Searches text inside document with regex support;
- Extracts data from PDF forms;
- Reads text from scanned PDF documents using OCR (Optical Character Recognition);
- Provides ActiveX interface to use from legacy programming languages (Visual Basic 6, Delphi) and scripting (VBscript, JScript and others);
- And much more...

History of changes:

11.2.0.3919 (June 20, 2020)
===========================
+ 'MultimediaExtractor' now supports extraction of 3D-animation objects.
- 'TextExtractor.Find()' now keeps original font names in found object information.
= Improved column detection in `ColumnDetectionMode.Borders` mode.
- 'SearchablePDFMaker' did not process vector-only pages. Fixed now.
= Improved regex text search in 'TextExtractor'.
+ Added 'DetectUnderlineTextStyle' and 'DetectStrikeoutTextStyle' properties to 'JSONExtractor' and 'XMLExtractor'.
+ Added 'OCRWhiteList' and 'OCRBlackList' properties to extractors.
+ Added 'Invert' OCR preprocessing filter.
+ Added 'Scale' OCR preprocessing filter.
= Improved joining of multi-line cells in tables without borders (`LineGroupingMode.JoinOrphanedRows` mode).
= Improved performance of 'ImageExtractor'.
+ Added page rectangles to 'InfoExtractor'.
= Improved 'OCRAnalyzer'.
= Improved automatic deletion of duplicated text objects during the extraction.
- Fixed extraction issues in .NET Core version.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

11.1.0.3845 (March 19, 2020)
============================
+ Added 'OCROverallConfidence' property in all extractors that.
+ SearchablePDFMaker: Added 'KeepOriginalRotation' property.
- SearchablePDFMaker: fixed crash on mixed English-Arabic text recognition.
+ PDF Multitool: Added "Developer Tools" sub-menu to the context menu.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

11.0.0.3805 (February 11, 2020)
===============================
+ Added support for new revision of PDF encryption (ISO 32000-2:2017 compliance).
+ Added 'LicenseInfo' property providing detailed information about your license.
+ Added 'Grayscale' filter to OCRImagePreprocessingFilters.
= Dramatically improved column extraction for multiple tables on a page. Works only in `ColumnDetectionMode.Borders` mode for tables with borders between columns and rows.
= Greatly improved `ColumnDetectionMode.BorderedTables`. As in the table detection, it now uses optical recognition to detect bordered tables and their columns on scanned documents.
= Improved 'InfoExtractor' to return the encrypted and password-protected states without asking a password or throwing an exception.
= Added document permissions information to 'InfoExtractor'.
= DocumentSplitter: added zero-padding to page numbers in generated file names.
= Improved extraction of duplicated text (shadow-like effect).
= Improved 'MultimediaExtractor'.
- Fixed text search issues on some documents.
- Fixed bug that damaged extracted text only during multi-thread processing.
- Fixed crash on subsequent extractions with different OCR modes.
- Fixed .NET Core compatibility issue.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

10.8.0.3732 (December 4, 2019)
==============================
+ Remover2: Added 'MaskColor' property that allows to change color of masking rectangle.
- Remover and Remover2: Fixed incomplete removal of the text in some cases.
- XMLExtractor and XFDFExtractor: fixed missing control types.
- Fixed parsing of combobox items that consist of value+label pairs.
= Improved handling of Arabic fonts and charsets.
= Improved handling of CJK fonts and charsets.
= Improved parsing of PDF documents.
- Other minor fixes and improvements.

...

NuGet packages (1)

Showing the top 1 NuGet packages that depend on Bytescout.PDFExtractor:

Package Downloads
BizDoc.Invy
See https://www.npmjs.com/package/bizdoc.core.invy

GitHub repositories

This package is not used by any popular GitHub repositories.

Version History

Version Downloads Last updated
11.2.1.3959 361 9/1/2020
11.2.1.3929 422 7/14/2020
11.2.1.3926 117 7/9/2020
11.2.0.3919 178 6/30/2020
11.1.0.3869 2,259 4/10/2020
11.1.0.3864 274 4/4/2020
11.1.0.3849 327 3/27/2020
11.1.0.3845 312 3/19/2020
11.0.0.3834 439 3/6/2020
11.0.0.3832 187 3/4/2020
11.0.0.3830 170 3/4/2020
11.0.0.3815 358 2/21/2020
11.0.0.3805 405 2/11/2020
10.8.0.3758 1,087 12/19/2019
10.8.0.3750 249 12/17/2019
10.8.0.3744 201 12/12/2019
10.8.0.3741 157 12/10/2019
10.8.0.3736 269 12/6/2019
10.8.0.3732 204 12/4/2019
10.7.2.3710 543 11/13/2019
10.7.1.3705 200 11/11/2019
10.7.0.3697 320 11/2/2019
10.6.0.3666 943 10/1/2019
10.5.0.3637 945 9/2/2019
10.4.0.3618 631 8/15/2019
10.4.0.3613 255 8/13/2019
10.4.0.3602 324 8/7/2019
10.3.0.3566 832 7/2/2019
10.2.0.3548 844 6/13/2019
10.2.0.3534 237 6/11/2019
10.2.0.3525 255 6/7/2019
10.2.0.3514 291 5/28/2019
10.1.0.3444 696 4/5/2019
10.1.0.3439 273 4/4/2019
10.0.0.3429 340 3/25/2019
10.0.0.3427 258 3/25/2019
10.0.0.3424 265 3/23/2019
10.0.0.3423 240 3/23/2019
10.0.0.3422 251 3/23/2019
10.0.0.3421 306 3/21/2019
9.4.0.3398 364 3/12/2019
9.3.0.3366 680 2/12/2019
9.3.0.3357 357 2/4/2019
9.3.0.3354 260 1/31/2019
9.2.0.3293 1,194 11/20/2018
9.2.0.3262 580 10/24/2018
9.2.0.3259 322 10/24/2018
9.1.0.3170 925 7/26/2018
9.1.0.3167 491 7/18/2018
9.1.0.3165 385 7/18/2018
9.1.0.3163 446 7/18/2018
9.0.0.3095 1,566 4/23/2018
9.0.0.3087 686 4/13/2018
9.0.0.3080 497 4/11/2018
8.8.1.3046 917 2/20/2018
8.8.1.3025 1,129 1/29/2018
8.8.0.3021 541 1/23/2018
8.7.0.2981 2,152 11/8/2017
8.6.0.2917 1,484 8/2/2017
8.6.0.2912 467 8/1/2017
8.5.0.2863 695 6/9/2017
8.5.0.2861 532 6/8/2017
8.5.0.2856 555 6/1/2017
8.4.1.2829 4,773 4/12/2017
8.4.0.2821 544 3/29/2017
8.3.0.2809 863 3/13/2017
8.3.0.2806 481 3/12/2017
8.3.0.2803 479 3/6/2017
8.3.0.2801 464 3/6/2017
8.3.0.2800 469 3/6/2017
8.3.0.2798 454 3/6/2017
8.3.0.2796 471 3/6/2017
8.3.0.2794 472 3/6/2017
8.2.0.2699 867 1/11/2017
8.1.1.2606 1,330 10/25/2016
8.1.0.2600 538 10/21/2016
8.0.0.2542 727 9/1/2016
8.0.0.2541 511 9/1/2016
8.0.0.2528 563 8/23/2016
8.0.0.2523 508 8/19/2016
7.0.0.2493 23,610 6/27/2016
7.0.0.2489 466 6/27/2016
7.0.0.2480 1,107 6/10/2016
7.0.0.2474 814 5/26/2016
6.30.0.2421 705 3/24/2016
6.20.0.2354 722 1/20/2016
6.12.0.2239 3,479 9/22/2015
5.20.0.1871 1,201 2/5/2015
5.0.0.1626 1,236 8/14/2014
4.0.0.1487 764 5/31/2014
3.40.0.1349 894 3/11/2014
3.20.0.1092 895 8/5/2013
3.20.0.1075 1,568 7/12/2013
3.10.0.1051 773 6/29/2013
3.0.0.839 853 3/26/2013
2.50.0.769 847 2/25/2013