Recyclable.Collections
0.0.3.3-alpha
See the version list below for details.
dotnet add package Recyclable.Collections --version 0.0.3.3-alpha
NuGet\Install-Package Recyclable.Collections -Version 0.0.3.3-alpha
<PackageReference Include="Recyclable.Collections" Version="0.0.3.3-alpha" />
paket add Recyclable.Collections --version 0.0.3.3-alpha
#r "nuget: Recyclable.Collections, 0.0.3.3-alpha"
// Install Recyclable.Collections as a Cake Addin #addin nuget:?package=Recyclable.Collections&version=0.0.3.3-alpha&prerelease // Install Recyclable.Collections as a Cake Tool #tool nuget:?package=Recyclable.Collections&version=0.0.3.3-alpha&prerelease
Recyclable.Collections
Recyclable.Collections
project is an open source framework for operating dynamic lists at performance close to raw arrays, but fairly unlimited in size. It aims at providing minimal memory footprint. It implements IList<T>
interface and is targeted as direct replacements of List<T>
, SortableList<T>
, PriorityQueue<T>
& similar.
Included
RecyclableList<T>
RecyclableLongList<T>
Milestones
- ✅ Create basic classes
- ✅
RecyclableList<T>
- ✅
RecyclableLongList<T>
- 🅿️
RecyclableQueue<T>
- 🅿️
RecyclableSortedList<T>
- 🅿️
RecyclableStack<T>
- 🅿️
RecyclableUnorderedList<T>
- ✅
- ✅ Create basic unit tests
- ✅
RecyclableList<T>
- ✅
RecyclableLongList<T>
- 🅿️
RecyclableQueue<T>
- 🅿️
RecyclableSortedList<T>
- 🅿️
RecyclableStack<T>
- 🅿️
RecyclableUnorderedList<T>
- ✅
- ✅ Optimize
RecyclableList<T>
- ✅
Add
- ✅
AddRange
- ✅ when source is
array<T>
- ✅ when source is
List<T>
- ✅ when source is
IList<T>
- ✅ when source is
RecyclableList<T>
- ✅ when source is
RecyclableLongList<T>
- ✅ when source is
IEnumerable<T>
- ✅ when source has non-enumerated count
- ✅ when source is
- ✅
Clear
- ✅
Contains
- ✅
CopyTo
- ✅
EnsureCapacity
- ✅
GetEnumerator
- ✅
IndexOf
- ✅
Insert
- ✅
Remove
- ✅
RemoveAt
- ✅
Resize
- ✅
this[int index]
- ✅
- ✅ Port
RecyclableList<T>
implementation toRecyclableLongList<T>
- 👉 Optimize
RecyclableLongList<T>
- ✅
Add
- ✅
AddRange
- ✅ when source is
array<T>
- ✅ when source is
List<T>
- ✅ when source is
IList<T>
- ✅ when source is
RecyclableList<T>
- ✅ when source is
RecyclableLongList<T>
- ✅ when source is
IEnumerable<T>
- ✅ when source has non-enumerated count
- ✅ when source is
- ✅
Clear
- ✅
Contains
- 🅿️
CopyTo
- ✅
EnsureCapacity
- ✅
GetEnumerator
- ✅
IndexOf
- ✅
LongIndexOf
- ✅
Insert
- ✅
Remove
- ✅
RemoveAt(int index)
- ✅
RemoveAt(long index)
- ✅
RemoveAt
- ✅
Resize
- ✅
this[int index]
- ✅
this[long index]
- ✅ Rename
RecyclableList<T>
toRecyclableLongList<T>
- ✅ Rename
RecyclableArrayList<T>
toRecyclableList<T>
- ✅ Fix failing tests
- ✅ Hide not ready classes
- ✅
RecyclableQueue<T>
- ✅
RecyclableSortedList<T>
- ✅
RecyclableStack<T>
- ✅
RecyclableUnorderedList<T>
- ✅
- ✅ Add support for
ReadOnlySpan<T>
- ✅ Release 0.0.3-alpha
- ✅ Implement
List<T>
interfaces- ✅
IList<T>
- ✅
ICollection<T>
- ✅
IEnumerable<T>
- ✅
IEnumerable
- ✅
IReadOnlyList<T>
- ✅
IReadOnlyCollection<T>
- ✅
IList
- ✅
ICollection
- ✅
- ✅ Implement list versioning to allow data change identification
- 👉 Make sure that
NeedsClearing
is used & items are cleared in- 🅿️
Clear
- 🅿️
Dispose
- 🅿️
Remove
- 🅿️
RemoveAt
- 🅿️
RemoveBlock
- 🅿️ Non-versioned & Versioned Enumerators
- 🅿️
- 🅿️ Add
.ToRecyclableList
/.ToRecyclableLongList
variants for all supported collection types- 🅿️
RecyclableList
- 🅿️
RecyclableLongList
- 🅿️
IList<T>
- 🅿️
ICollection<T>
- 🅿️
IEnumerable<T>
- 🅿️
IEnumerable
- 🅿️
IReadOnlyList<T>
- 🅿️
IReadOnlyCollection<T>
- 🅿️
IList
- 🅿️
ICollection
- 🅿️
- 🅿️ Release 0.0.3-beta
- 🅿️ Add support for
ulong
indexing- 🅿️ Convert
_memoryBlocks
toArray
to allowulong
lengths - 🅿️ Convert block indexes from
int
toulong
orlong
- 🅿️ Convert
- 🅿️ Overflow review
- 🅿️ Add type casting to
long
for<<
&>>
operations, where required - 🅿️ Make type castings
checked
- 🅿️ Add type casting to
- 🅿️ Port
RecyclableLongList<T>
optimizations toRecyclableList<T>
- 🅿️ Release 0.0.3
- ✅
- 🅿️ Implement
ILongList<T>
interface- 🅿️
RecyclableList<T>
- 🅿️
RecyclableLongList<T>
- 🅿️
- 🅿️ Implement
RecyclableQeueue<T>
- 🅿️ Port
RecyclableLongList<T>
optimizations toRecyclableQueue<T>
- 🅿️ Release 0.0.4
- 🅿️ Port
- 🅿️ Implement
RecyclableStack<T>
- 🅿️ Port
RecyclableLongList<T>
optimizations toRecyclableStack<T>
- 🅿️ Release 0.0.5
- 🅿️ Port
- 🅿️ Implement
RecyclableSortedList<T>
- 🅿️ Port
RecyclableLongList<T>
optimizations toRecyclableSortedList<T>
- 🅿️ Release 0.0.6
- 🅿️ Port
- 🅿️ Implement
RecyclableUnorderedList<T>
- 🅿️ Port
RecyclableLongList<T>
optimizations toRecyclableUnorderedList<T>
- 🅿️ Release 0.0.7
- 🅿️ Port
- 🅿️ Implement
RecyclableVersionedList<T>
- 🅿️ Port
RecyclableList<T>
toRecyclableVersionedList<T>
- 🅿️ Port
RecyclableLongList<T>
toRecyclableVersionedLongList<T>
- 🅿️ Release 0.0.7a
- 🅿️ Port
- 🅿️ Optimize
OneSizeArrayPool
- 🅿️ Review locks
- 🅿️ Measure multi-threading performance
- 🅿️ Implement memory bucket disposal in high RAM pressure scenario
- 🅿️ Optimize
RecyclableCollectionVersionObjectPool
- 🅿️ Implement custom
ObjectPool<T>
- 🅿️ Measure multi-threading performance
- 🅿️ Implement memory bucket disposal in high RAM pressure scenario
- 🅿️ Implement custom
- 🅿️ Review
RecyclableArrayPool
- 🅿️ Review locks
- 🅿️ Measure multi-threading performance
- 🅿️ Implement array disposal in high RAM pressure scenario
- 🅿️ Optimize
MemoryBucket<T>
- 🅿️ Convert to
struct
, if possible - 🅿️ Find out if there are better replacements
- 🅿️ Release 0.0.8
- 🅿️ Convert to
- 🅿️ Optimize
- 🅿️
IndexOfSynchronizationContext
- 🅿️
IndexOfSynchronizationContextPool
- 🅿️
ManualResetEventSlimmer
- 🅿️ Multi-threading benchmarks
- 🅿️
ManualResetEventSlimmerPool
- 🅿️ Multi-threading benchmarks
- 🅿️
SpinLockSlimmer
- 🅿️ Multi-threading benchmarks
- 🅿️
- 🅿️ Release 0.0.9-beta
- 🅿️ Extend unit tests
- 🅿️
.Add
/.AddRange
must allownull
values - 🅿️
.Remove
/.RemoveAt
/.Clear
must clear reference when reference type - 🅿️
RecyclableLongListExtensions.CopyTo
- 🅿️
- 🅿️ Cleanup
- 🅿️ Replace
LastBlockWithData
property with_lastBlockWithData
field - ✅
RecyclableLongListExtensions
- ✅
ListExtensions
- 🅿️
MathUtils
- 🅿️ Resolve TODOs
- 🅿️ Replace
- 🅿️ Optimize
- 🅿️
ListExtensions
- 🅿️
MathUtils
- 🅿️
SystemRandomNumberGenerator
- 🅿️
RecyclableLongListExtensions
- 🅿️
- 🅿️ Review and remove warnings & hints
- 🅿️ Warnings
- 🅿️ Hints
- Documentation
- 🅿️ Document the most efficient iteration over array collections
- 🅿️ Document the most efficient iteration over blocked collections
- 🅿️ Document differences in behavior
- 🅿️ Document other specifics
- 🅿️ Release 1.0.0
- 🅿️ Implement source code generators
- 🅿️ Add attributes
- 🅿️
GeneratorBaseClassAttribute
for marking base class used for generation - 🅿️
VersionedAttribute
for marking classes as versioned - 🅿️
IncVersionAttribute
for marking methods & setters as resulting in version increase
- 🅿️
- 🅿️ Implement source code generator
- 🅿️ Generate partial class for classes marked with
GeneratorBaseClassAttribute
- 🅿️ Add support for fields
- 🅿️ Add support for properties
- 🅿️ Add support for methods
- 🅿️ Add support for constructor
- 🅿️ Skip base fields, methods, properties etc. when they're overridden in the child class
- 🅿️ Generate partial class for classes marked with
- 🅿️ Add attributes
- 🅿️ Optimize
- 🅿️
RecyclableLongList<T>.Resize
- 🅿️
RecyclableLongList<T>.CopyTo
- 🅿️ Check if we can benefit from Sse2 in
.IndexOf
/.Contains
methods as given in MS blog.
- 🅿️
- 🅿️ Add support for
ICollection<T>
interface in.AddRange
&constructor
- 🅿️ Final optimizations
- 🅿️ Replace
Math
class usages withif
statements - 🅿️ Replace
a - b > 0
&a - b < 0
comparisons witha > b
&a < b
- 🅿️ Replace
a + b > 0
&a + b < 0
comparisons witha > b
&a < b
- 🅿️ Replace
a / b
&a * b
calculations with equivalents, where possible - 🅿️ Replace virtual calls with static calls
- 🅿️ Replace
blockSize
sums by powers of 2, minus 1 - 🅿️ Remove type castings, if possible
- 🅿️ Convert generic methods to non-generic
- 🅿️ Replace integer comparisons with comparisons to 0 / 1, if possible
- 🅿️ Replace
- 🅿️ Release 2.0.0
Characteristics of the classes
Common
- All classes implement
IDisposable
interface & SHOULD be disposed after use. That's to return block arrays taken from the shared pool. It may be foreseen as an issue for replacement in existing code, which obviously is missingusing
clause. But considering thatDispose
will be called byGC
anyway, it should cause issues in specific scenarios, only. In either case the fix is one-word addition ofusing
. - Memory blocks are created in 2 ways, depending on
blockSize
value- when
blockSize < 128
⇨ blocks are created as regulararrays
. - when
blockSize >= 128
⇨ blocks are taken from and returned toArrayPool<T>.Shared
, when blocks are removed. - maximum
blockSize
value is2_147_483_591
, which corresponds more or less to the maximum array size. blockSize
MUST be a power of 2. It will be rounded up to the closest power of 2, if needed. That is due to high performance gain on some operations, like the calculation of item index in a block.
- when
- Array pools are shared between the same
T
type. I.e.List<int>
will use a different pool fromList<short>
and so on. For high concurrency environments you may want to provide your own pools, when this feature becomes available in the upcoming releases. - Trying to access
this[int index]
,this[long index]
,Count
,LongCount
etc. whenCapacity == 0
will raiseNullReferenceException
. This is by design to remove all non-critical code from the constructor. - ⚠️ The default enumerators don't check if the collection was modified. That's by design due to high performance hit of the check. That should be fine in most use-cases. If you need to check for modifications, you're welcomed to use
.GetVersionedEnumerator()
/IRecyclableVersionedXxxList<T>
type-casting, e.g. inforeach
loops.
foreach (var item in (IRecyclableVersionedList<long>)recyclableList)
{
...
...
...
}
foreach (var item in (IRecyclableVersionedLongList<long>)recyclableLongList)
{
...
...
...
}
⚠️ The state of the default enumerators before the first & after the last call to
.MoveNext()
is undefined, but no exception is raised. That's by design due to high performance hit of the checks. This behavior is compatible withforeach
loops, which will never access.Current
property- when the collection is empty, or
- when
.MoveNext()
returnedfalse
, after reaching the end of the collection.
⚠️ If you decide to use
.GetEnumerator()
/.GetVersionedEnumerator()
, instead of relying onforeach
loops- you MUST ensure that you always respect the the result of
.MoveNext()
call, and - you MUST NOT access
.Current
property before calling.MoveNext()
or after reaching the end of the collection.
Failing to do so WILL result in unpredictable behavior.Benchmark results
- you MUST ensure that you always respect the the result of
RecyclableList<T>
- Range:
int
- Interfaces:
IList<T>
,IEnumerable<T>
,IDisposable
This is the direct equivalent ofList<T>
class, except that the arrays are taken from the shared pool, instead of directly being allocated. It proves to out performList<T>
& the standard arrays in certain operations and is fairly close in others. But they may perform worse in certain scenarios. If you're interested in details, please refer to benchmarks.
RecyclableLongList<T>
, RecyclableSortableList<T>
, RecyclableQueue<T>
, RecyclableStack<T>
Range:
long
Interfaces:
IList<T>
,ILongList<T>
,IEnumerable<T>
,IDisposable
All classes provide large storage capabilities, in practice limited only by the memory available in your system.
All data is stored in blocks, with the provided
int blockSize
or the default, currently being10,240
items in each block. It's recommended to use smaller numbers, if you foresee storing low no. of items on the list.There is no memory copying to increase the capacity of the recyclable classes, unless the no. of items exceeds the no. of allocated blocks. In that case, a new array of blocks is created to accommodate . If more capacity is needed, a new memory block is allocate & added to the internal list of blocks. This is the only list that may re-allocate memory block & copy their content to grow. That shouldn't be an issue considering limited no. of blocks stored by each class in practical scenarios. Currently I'm not planning any works around that. Shall there be a need to eliminate it, you're welcomed to file a PR with the proposed changes.
⚠️
IndexOf
will automatically switch to parallel search mode. Collections bigger than (currently) 850_000 items will be scanned using parallelization. In either case it'll return the index of the first matching item. This note also applies toContains
, because it internally usesIndexOf
. The value of 850_000 may become configurable in the future, unless it proves to result in much worse performance.The parallelization utilizes default
Task
scheduling, resulting in non-deterministic order in which search tasks are executed. In most cases it resulted in significant performance improvements. Below you can find the benchmarks for various item counts.As
RecyclableLongList<T>
is foreseen for use-cases with big data, you should automatically benefit from it in many use-cases.Benchmark Results
Insert
,Remove
&RemoveAt
methods raiseNotSupportedException
. This is by design in the current version, because it involves performance degradation. I'm planning works around this in the upcoming weeks. If you need that sooner, you're welcomed to file a PR with the proposed changes.
Architecture Decisions
The architecture of the classes included in this project is driven primarily by performance & memory footprints. The following decisions were made to support it.
Code duplication
There is high level of code duplication in each of the classes. When you look at them, you will see they are alike & some of the code could be extracted to extension methods. It has been done & has proven to perform worse. For this reason, properties & methods expected to be on the hot-path, duplicate much of the code to get the most out of it.
Common base - ILongList<T>
Having a common base class for all the classes looks very attractive. But it is also proved to perform worse, if you start making properties & methods virtual & there are subsequent calls to base
. For this reason, each of the classes is the root class & there are hardly any virtual methods & properties. All classes are meant to be final
, but I've not marked them as sealed
. If you feel that you'd benefit from it, you're welcomed to inherit from them. The protected stuff is there for your use.
If you desire to have a common base for calls, each of the classes implements IList<T>
& ILongList<T>
interfaces. ILongList<T>
is a new interface, made alike IList<T>
, but unlocking full support for long
indexing & use. Considering the characteristics of the classes I feel that the generic name is justified, instead of IRecyclableList<T>
what you could expect.
Protection against incorrect use & state
Having protection against invalid state of the object is an important part of software engineering. The classes included in this project provide the same level of protection as the regular array
& List
class. They will raise exceptions when invalid index is given. But there is no additional protection that the content & state of the objects internally used by these classes are valid. You can break them, if you want to.
This decision is driven by performance measures. Considering wide use of the lists on the hot-paths, each additional check & operation starts counting. I've decided to eliminate all the checks for the sake of performance. If you feel that you need additional protection, feel free to inherit or reuse any of the provided classes to provide that.
this[int index]
, Count
, IndexOf(T item)
etc. may overflow, but...
All the classes included in this package support long
indexing and addressing. The LongCount
property will always provide the total no. of items stored on the list. Since Count
is just LongCount
type-casted to int
, its value may overflow.
But List<T>
addressing is kind of currently limited to int
ranges anyway in practical scenarios. As such Count
overflowing shouldn't be an issue when using the classes with existing code. If you use Count
in your loops, it's recommended to replace it with LongCount
, tough. You don't need to make any changes in foreach
or other iterator loops. They are expected to transparently use the new classes & yield all items, beyond int.MaxValue
range.
This is by design, for the best performance. Initially there have been checks in place to prevent overflowing & return int.MaxValue
instead, but they proved to slow down the operations. Considering low risk for existing code, I've decided to take it out.
If you foresee the overflowing as an issue, you're welcomed to propose an improvement & file a PR with proposed solution.
Auto-properties, new language features & code style
When you look at the code you may note, that it looks like in early c# days. Where the new language features yield the same or better performance, they are & will be used.
However, many of the new language features have proved to provide worse performance at some point in my testing. It includes auto-properties, for which I could see getters & setters generated in IL, instead of fields. Because all the classes are expected to be on the hot-paths, I've eliminated most of them.
Similarly, you will find places in code with hints or warnings disabled. The decision was always driven by performance measures. The use case & business logic was carefully reviewed to ensure it's safe to take shortcuts, before making the change. If you run into failing scenario, please feel free to create a work item against it and/or file a PR to fix it.
Finally, you'll find places with code commented out. That's to easily track what has been tried & what proved to perform worse.
At any time, if you know & can show in your benchmarks that the code can be simplified without impacting the performance, feel free to file a PR with proposed changes. Please include benchmark results from your testing to speed up things.
Thread Safety
The public static
members of the classes are thread safe. Any instance members are not guaranteed to be thread safe. You need to implement locking mechanism to safely use the classes in a multi-threading environment.
Use
All the classes included in this package are meant to be direct replacements of their corresponding system classes.
IList<T>
⇨ILongList<T>
List<T>
⇨RecyclableLongList<T>
Queue<T>
⇨RecyclableQueue<T>
PriorityQueue<T>
⇨RecyclableSortedList<T>
SortedList<T>
⇨RecyclableSortedList<T>
Stack<T>
⇨RecyclableStack<T>
Examples of each of the above classes are provided below. Please refer automated tests for more.
RecyclableList<T>
It's the closest equivalent to List<T>
in regards to functionality. It supports items' removal & delivers the best performance in most scenarios. But it's limited to int
range, tough. If you find that acceptable, my recommendation is to use this class as the direct replacement for List<T>
.
Do note, that it implements ILongList<T>
interface, too. This is to allow having common code base regardless, if you use the basic RecyclableList<T>
or it's long
-range version - RecyclableLongList<T>
. By doing that you can switch between them with highly limited changes in code.
RecyclableLongList<T>
public void RecyclableListExample()
{
int[] testNumbers = new[] { 1, 2, 2, 3 };
// Create
using var list = new RecyclableLongList<int>();
// Add items
foreach (var index in testNumbers)
{
list.Add(index);
}
foreach (var item in list)
{
Console.WriteLine(item);
}
// This will iterate over this[long index]
for (var itemIdx = 0; itemIdx <= list.LongCount; itemIdx++)
{
Console.WriteLine(list[itemIdx]);
}
// This will iterate over this[int index] and may overflow if there are > int.MaxValue no. of items on the list. See: Architecture Decisions
for (var itemIdx = 0; itemIdx <= list.LongCount; itemIdx++)
{
Console.WriteLine(list[itemIdx]);
}
list.Clear();
}
Product | Versions Compatible and additional computed target framework versions. |
---|---|
.NET | net6.0 is compatible. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 is compatible. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed. |
-
net6.0
- Microsoft.Extensions.ObjectPool (>= 7.0.5)
- morelinq (>= 3.4.2)
-
net7.0
- Microsoft.Extensions.ObjectPool (>= 7.0.5)
- morelinq (>= 3.4.2)
NuGet packages (4)
Showing the top 4 NuGet packages that depend on Recyclable.Collections:
Package | Downloads |
---|---|
Recyclable.Collections.TestData
`Recyclable.Collections.TestData` project is a basic open source package for containing test data useful for testing collections like `RecyclableList<T>`, `RecyclableLongList<T>`, `List<T>`, `SortableList<T>`, `PriorityQueue<T>` & similar. |
|
Collections.Benchmarks.Core
`Collections.Benchmarks.Core` project is a basic open source framework for performance measurements based on various collection types like `RecyclableList<T>`, `RecyclableLongList<T>`, `List<T>`, `SortableList<T>`, `PriorityQueue<T>` & similar. |
|
Recyclable.Collections.Compatibility.List
Recyclable.Collections.Compatibility.List project is an open source framework for operating dynamic lists at performance close to raw arrays. It aims at providing minimal memory footprint. It implements IList<T>'s interface and is targeted as direct replacements of List<T>, SortableList<T>, PriorityQueue<T> and similar. |
|
Recyclable.Collections.TestData.xUnit
`Recyclable.Collections.TestData.xUnit` project is a basic open source package for working with repeatable xUnit test data useful for testing collections like `RecyclableList<T>`, `RecyclableLongList<T>`, `List<T>`, `SortableList<T>`, `PriorityQueue<T>` & similar. |
GitHub repositories
This package is not used by any popular GitHub repositories.
Version | Downloads | Last updated | |
---|---|---|---|
0.0.6.2 | 255 | 1/17/2024 | |
0.0.6.1 | 387 | 10/21/2023 | |
0.0.6 | 336 | 10/8/2023 | |
0.0.5 | 202 | 9/10/2023 | |
0.0.4 | 225 | 8/6/2023 | |
0.0.3.4-alpha | 172 | 7/16/2023 | |
0.0.3.3-alpha | 170 | 6/25/2023 | |
0.0.3.2-alpha | 164 | 6/10/2023 | |
0.0.3.1-alpha | 146 | 6/10/2023 | |
0.0.3 | 202 | 6/10/2023 | |
0.0.2 | 336 | 1/3/2023 | |
0.0.1 | 329 | 1/2/2023 |
Various performance improvements, including enumerators.