Edi.WordFilter
2.5.0
dotnet add package Edi.WordFilter --version 2.5.0
NuGet\Install-Package Edi.WordFilter -Version 2.5.0
<PackageReference Include="Edi.WordFilter" Version="2.5.0" />
paket add Edi.WordFilter --version 2.5.0
#r "nuget: Edi.WordFilter, 2.5.0"
// Install Edi.WordFilter as a Cake Addin #addin nuget:?package=Edi.WordFilter&version=2.5.0 // Install Edi.WordFilter as a Cake Tool #tool nuget:?package=Edi.WordFilter&version=2.5.0
Edi.WordFilter
Basic word filter used in my blog system to filter dirty words (e.g. insulting languages, impertinent words)
Install
dotnet add package Edi.WordFilter
Usage
Prepare a text file with banned words, for example splitted by |
. Like this:
fuck|shit|ass
Use it like this
var wordFilterDataFilePath = $"{AppDomain.CurrentDomain.GetData(Constants.DataDirectory)}\\BannedWords.txt";
var filter = new TrieTreeWordFilter(wordFilterDataFilePath);
var output = filter.FilterContent("Go fuck yourself and eat some shit!");
// output: Go **** yourself and eat some ****!
Design Details
TrieTreeWordFilter
The FilterContent()
method in the TrieTreeWordFilter
class is used to filter sensitive words from a given string content. It uses a Trie data structure to efficiently find and replace sensitive words with asterisks (*).
This method is efficient for filtering sensitive words, especially when there is a large set of words to filter. However, it assumes that the Trie tree has been properly initialized with all the sensitive words.
A Trie, also known as a prefix tree, is a tree-like data structure that is used to store a collection of strings. Each node of the Trie represents a character of a string and the root of the Trie represents an empty string or the start of a string. The strings are stored in a way that all the descendants of a node have a common prefix of the string associated with that node. Here's a simple example of how a Trie might look when storing the words "car", "cat", and "dog":
root
├── c
│ ├── a
│ │ ├── r
│ │ └── t
└── d
└── o
└── g
In this example, each path from the root to a node represents a string. For instance, the path from the root to the node 'r' represents the string "car".
Tries are particularly useful for operations that involve prefix matching, such as autocomplete features in text editors or web browsers, as they allow for efficient retrieval of all keys with a given prefix. They are also used in word filtering, as in the FilterContent() method you asked about earlier.
However, Tries can be memory-intensive, as each node may need to store pointers to many children. There are variations of the Trie data structure, such as the compressed Trie (also known as a Radix tree or Patricia tree), which help to mitigate this issue by merging nodes with a single child.
免责申明
此项目(Edi.WordFilter)及其配套组件均为免费开源的产品,仅用于学习交流,并且不直接向中国提供服务,中国访客请于下载后立即删除。
任何中国境内的组织及个人不得使用此项目(Edi.WordFilter)及其配套组件构建任何形式的面向中国境内访客的网站或服务。
不可用于任何违反中华人民共和国(含台湾省)或使用者所在地区法律法规的用途。
因为作者即本人仅完成代码的开发和开源活动(开源即任何人都可以下载使用),从未参与访客的任何运营和盈利活动。
且不知晓访客后续将程序源代码用于何种用途,故访客使用过程中所带来的任何法律责任即由访客自己承担。
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net8.0 is compatible. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. net9.0 is compatible. |
-
net8.0
- No dependencies.
-
net9.0
- No dependencies.
NuGet packages
This package is not used by any NuGet packages.
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated |
---|---|---|
2.5.0 | 142 | 11/13/2024 |
2.4.0 | 133 | 8/27/2024 |
2.3.0 | 309 | 1/3/2024 |
2.2.0 | 131 | 12/17/2023 |
2.1.0 | 110 | 12/15/2023 |
2.0.1 | 108 | 12/14/2023 |
2.0.0 | 121 | 12/14/2023 |
1.8.0 | 157 | 10/11/2023 |
1.7.1 | 116 | 10/10/2023 |
1.7.0 | 2,403 | 11/9/2022 |
1.6.1 | 2,844 | 8/6/2022 |
1.6.0 | 1,663 | 10/21/2021 |
1.5.0 | 430 | 9/14/2021 |
1.4.0 | 3,375 | 12/18/2020 |
1.3.1 | 576 | 11/27/2020 |
1.3.0 | 580 | 11/11/2020 |
1.2.3 | 2,417 | 2/25/2020 |
1.2.2 | 682 | 12/5/2019 |
1.2.1 | 652 | 10/21/2019 |
1.2.0 | 1,132 | 4/23/2019 |
1.1.0 | 954 | 12/2/2018 |
1.0.0 | 717 | 11/7/2018 |