Microsoft.ML.Tokenizers.Data.Cl100kBase
0.22.0-preview.24522.7
Prefix Reserved
See the version list below for details.
dotnet add package Microsoft.ML.Tokenizers.Data.Cl100kBase --version 0.22.0-preview.24522.7
NuGet\Install-Package Microsoft.ML.Tokenizers.Data.Cl100kBase -Version 0.22.0-preview.24522.7
<PackageReference Include="Microsoft.ML.Tokenizers.Data.Cl100kBase" Version="0.22.0-preview.24522.7" />
paket add Microsoft.ML.Tokenizers.Data.Cl100kBase --version 0.22.0-preview.24522.7
#r "nuget: Microsoft.ML.Tokenizers.Data.Cl100kBase, 0.22.0-preview.24522.7"
// Install Microsoft.ML.Tokenizers.Data.Cl100kBase as a Cake Addin #addin nuget:?package=Microsoft.ML.Tokenizers.Data.Cl100kBase&version=0.22.0-preview.24522.7&prerelease // Install Microsoft.ML.Tokenizers.Data.Cl100kBase as a Cake Tool #tool nuget:?package=Microsoft.ML.Tokenizers.Data.Cl100kBase&version=0.22.0-preview.24522.7&prerelease
About
The Microsoft.ML.Tokenizers.Data.Cl100kBase
includes the Tiktoken tokenizer data file cl100k_base.tiktoken
, which is utilized by models such as GPT-4.
Key Features
- This package mainly contains the cl100k_base.tiktoken file, which is used by the Tiktoken tokenizer. This data file is used by the following models: 1. gpt-4 2. gpt-3.5-turbo 3. gpt-3.5-turbo-16k 4. gpt-35 5. gpt-35-turbo 6. gpt-35-turbo-16k 7. text-embedding-ada-002 8. text-embedding-3-small 9. text-embedding-3-large
How to Use
Reference this package in your project to use the Tiktoken tokenizer with the specified models.
// Create a tokenizer for the specified model or any other listed model name
Tokenizer tokenizer = TiktokenTokenizer.CreateForModel("gpt-4");
// Create a tokenizer for the specified encoding
Tokenizer tokenizer = TiktokenTokenizer.CreateForEncoding("cl100k_base");
Main Types
Users shouldn't use any types exposed by this package directly. This package is intended to provide tokenizer data files.
Additional Documentation
Related Packages
Microsoft.ML.Tokenizers
Feedback & Contributing
Microsoft.ML.Tokenizers.Data.Cl100kBase is released as open source under the MIT license. Bug reports and contributions are welcome at the GitHub repository.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- Microsoft.ML.Tokenizers (>= 0.22.0-preview.24522.7)
NuGet packages (2)
Showing the top 2 NuGet packages that depend on Microsoft.ML.Tokenizers.Data.Cl100kBase:
Package | Downloads |
---|---|
Microsoft.KernelMemory.AI.Tiktoken
Provide tokenizers to allow counting content tokens for text and embeddings |
|
Microsoft.Teams.AI
SDK focused on building AI based applications for Microsoft Teams. |
GitHub repositories (4)
Showing the top 4 popular GitHub repositories that depend on Microsoft.ML.Tokenizers.Data.Cl100kBase:
Repository | Stars |
---|---|
microsoft/semantic-kernel
Integrate cutting-edge LLM technology quickly and easily into your apps
|
|
microsoft/kernel-memory
RAG architecture: index and query any data using LLM and natural language, track sources, show citations, asynchronous memory patterns.
|
|
axzxs2001/Asp.NetCoreExperiment
原来所有项目都移动到**OleVersion**目录下进行保留。新的案例装以.net 5.0为主,一部分对以前案例进行升级,一部分将以前的工作经验总结出来,以供大家参考!
|
|
sdcb/chats
Flexible frontend for managing and deploying language models.
|
Version | Downloads | Last updated |
---|---|---|
1.0.0 | 14,545 | 11/14/2024 |
0.22.0 | 339 | 11/13/2024 |
0.22.0-preview.24526.1 | 1,283 | 10/27/2024 |
0.22.0-preview.24522.7 | 1,566 | 10/23/2024 |