EchoSharp.Whisper.net
0.1.0
dotnet add package EchoSharp.Whisper.net --version 0.1.0
NuGet\Install-Package EchoSharp.Whisper.net -Version 0.1.0
<PackageReference Include="EchoSharp.Whisper.net" Version="0.1.0" />
paket add EchoSharp.Whisper.net --version 0.1.0
#r "nuget: EchoSharp.Whisper.net, 0.1.0"
// Install EchoSharp.Whisper.net as a Cake Addin #addin nuget:?package=EchoSharp.Whisper.net&version=0.1.0 // Install EchoSharp.Whisper.net as a Cake Tool #tool nuget:?package=EchoSharp.Whisper.net&version=0.1.0
EchoSharp
EchoSharp is an open-source library designed for near-real-time audio processing, orchestrating different AI models seamlessly for various audio analysis scopes. With an architecture that focuses on flexibility and performance, EchoSharp allows near-real-time Transcription and Translation by integrating components for Speech-to-Text and Voice Activity Detection.
Key Features
- Near-Real-Time Audio Processing: Handle audio data with minimal latency, ensuring efficient near-real-time results.
- Interchangeable Components: Customize or extend the library by building your own components for speech-to-text or voice activity detection. EchoSharp exposes flexible interfaces, making integration straightforward.
- Easy Orchestration: Manage and coordinate different AI models effectively for specific audio analysis tasks, like transcribing and detecting speech in various environments.
Get Started
Get started with EchoSharp and explore how adaptable, near-real-time audio processing can transform your projects.
You can find the latest EchoSharp version on nuget at: EchoSharp
First-Party components
EchoSharp.Whisper.net
EchoSharp.Whisper.net is a Speech-to-Text (STT) component built on top of Whisper.net, providing high-quality transcription and translation capabilities in a near-real-time setting. Leveraging the state-of-the-art Whisper models from OpenAI, this component ensures robust performance for processing audio input with impressive accuracy across multiple languages. It's designed to be highly efficient and easily interchangeable, allowing developers to customize or extend it with alternative STT components if desired.
Key Features:
- Multilingual Transcription: Supports transcription in multiple languages, with automatic detection and translation capabilities.
- Customizable Integration: Plug-and-play design that integrates seamlessly with EchoSharp's audio orchestration.
- Local Inference: Perform inference locally, ensuring data privacy and reducing latency for near-real-time processing.
EchoSharp.Onnx.SileroVad
EchoSharp.Onnx.SileroVad is a Voice Activity Detection (VAD) component that uses Silero VAD to distinguish between speech and non-speech segments in audio streams. By efficiently detecting voice activity, this component helps manage and optimize audio processing pipelines, activating transcription only when necessary to reduce overhead and improve overall performance.
Key Features:
- Accurate Voice Detection: Reliably identifies when speech is present, even in noisy environments.
- Resource Efficiency: Minimizes unnecessary processing by filtering out silent or irrelevant audio segments.
- Flexible Configuration: Easily adjustable settings to fine-tune voice detection thresholds based on specific use cases.
EchoSharp.OpenAI.Whisper
EchoSharp.OpenAI.Whisper is a Speech-to-Text (STT) component that leverages the OpenAI Whisper API.
Key Features:
- High-Quality Transcription: Utilizes the OpenAI Whisper API to provide accurate and reliable speech-to-text conversion.
- Azure or OpenAI APIs: Choose between Azure or OpenAI APIs for transcription based on your requirements. (just provide the AudioClient from OpenAI SDK or Azure SDK)
- Customizable Integration: Easily integrate with EchoSharp's audio orchestration for seamless audio processing.
EchoSharp.AzureAI.SpeechServices
EchoSharp.AzureAI.SpeechServices is a Speech-to-Text (STT) component that uses the Azure Speech Services API.
Key Features:
- Azure Speech Services Integration: Leverage the Azure Speech Services API for high-quality speech-to-text conversion.
- Real-Time Transcription: Process audio data in near-real-time with minimal latency.
- Customizable Configuration: Easily adjust settings and parameters to optimize transcription performance.
EchoSharp.WebRtc.WebRtcVadSharp
EchoSharp.WebRtc.WebRtcVadSharp is a Voice Activity Detection (VAD) component that uses the WebRTC VAD and WebRtcVadSharp algorithm to detect voice activity in audio streams. By accurately identifying speech segments, this component helps optimize audio processing pipelines, reducing unnecessary processing and improving overall efficiency.
Key Features:
- Efficient Voice Detection: Detects voice activity with high accuracy, even in noisy environments.
- Resource Optimization: Filters out silent or irrelevant audio segments to minimize processing overhead.
- Flexible Configuration: Easily adjust settings to fine-tune voice detection OperatingMode based on specific use cases.
EchoSharp.Onnx.Whisper
Experimental - This component is still in development and may not be suitable for production use.
EchoSharp.Onnx.Whisper is a Speech-to-Text (STT) component that uses an ONNX model for speech recognition.
Key Features:
- Customizable Speech Recognition: Utilize your own Whisper ONNX model for speech-to-text conversion.
- Local Inference: Perform speech recognition locally, ensuring data privacy and reducing latency.
- Flexible Integration: Seamlessly integrate with EchoSharp's audio processing pipeline for efficient audio analysis.
EchoSharp.Onnx.Sherpa
EchoSharp.Onnx.Sherpa is a Speech-to-Text (STT) component that uses multiple ONNX models for speech recognition. It integrates with this sherpa-onnx project and supports both OnlineModels and OfflineModels. Key Features:
- Customizable Speech Recognition: Utilize your own ONNX models for speech-to-text conversion.
- Local Inference: Perform speech recognition locally, ensuring data privacy and reducing latency.
- Flexible Integration: Seamlessly integrate with EchoSharp's audio processing pipeline for efficient audio analysis.
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 is compatible. net9.0-android was computed. net9.0-browser was computed. net9.0-ios was computed. net9.0-maccatalyst was computed. net9.0-macos was computed. net9.0-tvos was computed. net9.0-windows was computed. |
.NET Core | netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed. |
.NET Standard | netstandard2.0 is compatible. netstandard2.1 was computed. |
.NET Framework | net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed. |
MonoAndroid | monoandroid was computed. |
MonoMac | monomac was computed. |
MonoTouch | monotouch was computed. |
Tizen | tizen40 was computed. tizen60 was computed. |
Xamarin.iOS | xamarinios was computed. |
Xamarin.Mac | xamarinmac was computed. |
Xamarin.TVOS | xamarintvos was computed. |
Xamarin.WatchOS | xamarinwatchos was computed. |
-
.NETStandard 2.0
- EchoSharp.Abstractions (>= 0.1.0)
- Microsoft.Bcl.AsyncInterfaces (>= 9.0.0)
- System.IO.Compression (>= 4.3.0)
- System.Memory (>= 4.5.5)
- Whisper.net (>= 1.7.2)
-
net8.0
- EchoSharp.Abstractions (>= 0.1.0)
- Whisper.net (>= 1.7.2)
-
net9.0
- EchoSharp.Abstractions (>= 0.1.0)
- Whisper.net (>= 1.7.2)
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
0.1.0 | 125 | 12/26/2024 |