TikaOnDotnet.TextExtractor
1.12.2
See the version list below for details.
dotnet add package TikaOnDotnet.TextExtractor --version 1.12.2
NuGet\Install-Package TikaOnDotnet.TextExtractor -Version 1.12.2
<PackageReference Include="TikaOnDotnet.TextExtractor" Version="1.12.2" />
paket add TikaOnDotnet.TextExtractor --version 1.12.2
#r "nuget: TikaOnDotnet.TextExtractor, 1.12.2"
// Install TikaOnDotnet.TextExtractor as a Cake Addin #addin nuget:?package=TikaOnDotnet.TextExtractor&version=1.12.2 // Install TikaOnDotnet.TextExtractor as a Cake Tool #tool nuget:?package=TikaOnDotnet.TextExtractor&version=1.12.2
Example of how to use **TikaOnDotNet** for text extraction from rich documents.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET Framework | net is compatible. |
-
- TikaOnDotnet (>= 1.12.2 && < 1.13.0)
NuGet packages (6)
Showing the top 5 NuGet packages that depend on TikaOnDotnet.TextExtractor:
Package | Downloads |
---|---|
Contrib.Sitecore.ContentSearch.TikaOnDotnet
Contribution project for Sitecore ContentSearch |
|
DevelopmentHelpers.FileContentReader
This package combine many open sources packages and allow one interface to read may types of content files. for example:use open.xml to read docx file |
|
Cogworks.ExamineFileIndexer
An examine indexer that uses Apache TIKA |
|
Skybrud.Umbraco.Search.DocumentIndexer
This package makes it possible to index and search a wide variety of filetypes in Umbraco, including .pdf and .docx |
|
Jetsons.JetPack.Text
The wrapper library that provides smart extension methods to convert document formats to high quality text. |
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
1.17.1 | 520,798 | 4/3/2018 |
1.17.0 | 30,612 | 2/15/2018 |
1.16.0 | 163,602 | 7/30/2017 |
1.15.0 | 8,656 | 7/30/2017 |
1.14.2 | 114,481 | 4/22/2017 |
1.14.2-pre | 3,370 | 4/15/2017 |
1.14.1 | 18,664 | 1/13/2017 |
1.14.0 | 10,011 | 12/8/2016 |
1.13.1 | 10,615 | 8/16/2016 |
1.13.0 | 15,532 | 6/30/2016 |
1.12.2 | 17,874 | 4/12/2016 |
1.12.1 | 1,614 | 4/12/2016 |
1.12.0 | 1,750 | 4/11/2016 |
- Breaking Change: Renamed the namespace and assembly name of TikaOnDotNet to match the Nuget id (was `tika-app`). This should only affect the resulting filename of the assembly. All Tika code is namespaced with a Java style (com.apache.{yadda yadda}).
- Fix TextExtractor dependency so that it is using a "working" version of TikaOnDotNet (1.12.2)