Microsoft.ML.Tokenizers.Data.P50kBase
1.0.0
Prefix Reserved
dotnet add package Microsoft.ML.Tokenizers.Data.P50kBase --version 1.0.0
NuGet\Install-Package Microsoft.ML.Tokenizers.Data.P50kBase -Version 1.0.0
<PackageReference Include="Microsoft.ML.Tokenizers.Data.P50kBase" Version="1.0.0" />
paket add Microsoft.ML.Tokenizers.Data.P50kBase --version 1.0.0
#r "nuget: Microsoft.ML.Tokenizers.Data.P50kBase, 1.0.0"
// Install Microsoft.ML.Tokenizers.Data.P50kBase as a Cake Addin #addin nuget:?package=Microsoft.ML.Tokenizers.Data.P50kBase&version=1.0.0 // Install Microsoft.ML.Tokenizers.Data.P50kBase as a Cake Tool #tool nuget:?package=Microsoft.ML.Tokenizers.Data.P50kBase&version=1.0.0
About
The Microsoft.ML.Tokenizers.Data.P50kBase
includes the Tiktoken tokenizer data file p50k_base.tiktoken
, which is utilized by models such as text-davinci-002
.
Key Features
- This package mainly contains the
p50k_base.tiktoken
file, which is used by the Tiktoken tokenizer. This data file is used by the following models: 1. text-davinci-002 2. text-davinci-003 3. code-davinci-001 4. code-davinci-002 5. code-cushman-001 6. code-cushman-002 7. davinci-codex 8. cushman-codex
How to Use
Reference this package in your project to use the Tiktoken tokenizer with the specified models.
// Create a tokenizer for the specified model or any other listed model name
Tokenizer tokenizer = TiktokenTokenizer.CreateForModel("text-davinci-002");
// Create a tokenizer for the specified encoding
Tokenizer tokenizer = TiktokenTokenizer.CreateForEncoding("p50k_base");
Main Types
Users shouldn't use any types exposed by this package directly. This package is intended to provide tokenizer data files.
Additional Documentation
Related Packages
Microsoft.ML.Tokenizers
Feedback & Contributing
Microsoft.ML.Tokenizers.Data.P50kBase is released as open source under the MIT license. Bug reports and contributions are welcome at the GitHub repository.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- Microsoft.ML.Tokenizers (>= 1.0.0)
NuGet packages (1)
Showing the top 1 NuGet packages that depend on Microsoft.ML.Tokenizers.Data.P50kBase:
Package | Downloads |
---|---|
Microsoft.KernelMemory.AI.Tiktoken
Provide tokenizers to allow counting content tokens for text and embeddings |
GitHub repositories (1)
Showing the top 1 popular GitHub repositories that depend on Microsoft.ML.Tokenizers.Data.P50kBase:
Repository | Stars |
---|---|
microsoft/kernel-memory
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.
|
Version | Downloads | Last updated |
---|---|---|
1.0.0 | 1,642 | 11/14/2024 |
0.22.0 | 115 | 11/13/2024 |
0.22.0-preview.24526.1 | 116 | 10/27/2024 |
0.22.0-preview.24522.7 | 172 | 10/23/2024 |