CopyOnWrite 0.2.0

There is a newer version of this package available.
See the version list below for details.
dotnet add package CopyOnWrite --version 0.2.0
NuGet\Install-Package CopyOnWrite -Version 0.2.0
This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.
<PackageReference Include="CopyOnWrite" Version="0.2.0" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add CopyOnWrite --version 0.2.0
#r "nuget: CopyOnWrite, 0.2.0"
#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.
// Install CopyOnWrite as a Cake Addin
#addin nuget:?package=CopyOnWrite&version=0.2.0

// Install CopyOnWrite as a Cake Tool
#tool nuget:?package=CopyOnWrite&version=0.2.0

The CopyOnWrite library provides a .NET layer on top of OS-specific logic that provides copy-on-write linking for files (a.k.a. CoW, file cloning, or reflinking). CoW linking provides the ability to copy a file without actually copying the original file's bytes from one disk location to another. The filesystem is in charge of ensuring that if the original file is modified or deleted, the CoW linked files remain unmodified by lazily copying the original file's bytes into each link. Unlike symlinks or hardlinks, writes to CoW links do not write through to the original file, as the filesystem breaks the link and copies in a lazy fashion. This enables scenarios like file caches where a single copy of a file held in a content-addressable or other store is safely linked to many locations in a filesystem with low I/O overhead.

This library allows a .NET developer to:

  • Discover whether CoW links are allowed between two filesystem paths,
  • Discover whether CoW links are allowed for a directory tree based at a specific root directory,
  • Create CoW links,
  • Find filesystem CoW link limits.

Discovery is important, as different operating systems and different filesystems available for those operating systems provide varying levels of CoW link support:

  • Windows: The default NTFS filesystem does NOT support CoW, but the ReFS filesystem does.
  • Linux: Btrfs, Xfs, Zfs support CoW while ext4 does not.
  • Mac: AppleFS supports CoW by default.

When using this library you may need to create a wrapper that copies the file if CoW is not available.

Example

using Microsoft.CopyOnWrite;

ICopyOnWriteFilesystem cow = CopyOnWriteFilesystemFactory.GetInstance();
bool canCloneInCurrentDirectory = cow.CopyOnWriteLinkSupportedInDirectoryTree(Environment.CurrentDirectory);
if (canCloneInCurrentDirectory)
{
    cow.CloneFile(existingFile, cowLinkFilePath);
}

Windows: Workarounds for ReFS parallel cloning bug

As of September 2022, in Server 2022 and Windows 10 and 11 when cloning a single recently written source file in parallel the resulting cloned file can sometimes end up with all zeroes for its content. This comes from a bug in ReFS; see https://github.com/microsoft/CopyOnWrite/issues/12 for more details. This library provides workarounds for this problem, or you can change how source files are written to ensure they are flushed to disk before performing parallel clones. The workarounds include:

  • Serialize cloning system-wide per source path. You could accomplish this by ensuring only a single thread can clone a single source file at a time. This library takes this approach by default for single-process cloning by using an in-memory dictionary. You can opt into system-wide serialization using kernel mutexes by specifying useCrossProcessLocksWhereApplicable = true when calling CopyOnWriteFilesystemFactory.GetInstance().
  • Ensure the source file is completely flushed to disk before cloning. This can be accomplished through one of the approaches below. Note that if you use these approaches, you can increase performance of cloning by using CloneFileFlags.NoSerializedCloning on your CloneFile calls.
    • Use the FlushFileBuffers API to force the file to be flushed from memory. This should be called at the end of writing the source file to disk while the file write handle is still open. Alternately it could be called on a new handle to the file opened with GENERIC_WRITE.
    • When writing the source file, open the file handle with FILE_FLAG_NO_BUFFERING. However, note this requires the code writing to the file to deal with writing chunks aligned with the sector size of the underlying volume, and using chunks that are a multiple of the sector size.
    • When writing the source file, open the file handle with FILE_FLAG_WRITE_THROUGH. This forces a flush on every write, which can decrease performance significantly.

Release History

NuGet package:

  • 0.2.0 September 2022: Improve documentation for ReFS parallel cloning bug workarounds. Improve Windows cloning performance by 7.2% by using sparse destination files. Default behavior change to leave destination file sparse and replaced CloneFlags.NoSparseFileCheck with DestinationMustMatchSourceSparseness, hence minor version increase.
  • 0.1.13 September 2022: Fix CloneFlags to use individual bits.
  • 0.1.12 September 2022: Add new factory flag that sets a mode to require cross-process Windows mutexes for safe source file locking to avoid a ReFS concurrency bug. Add optimization to allow bypassing redundant Path.GetFullPath() when caller has done it already.
  • 0.1.11 September 2022: Serialize Windows cloning on source path to work around ReFS limitation in multithreaded cloning.
  • 0.1.10 September 2022: Fix missing destination file failure detection.
  • 0.1.9 September 2022: Add explicit cache invalidation call to interface. Update Windows implementation to detect ReFS mount points that are not drive roots, e.g. mounting D:\ (ReFS volume) under C:\ReFS.
  • 0.1.8 April 2022: Add overload for CoW clone to allow bypassing some Windows filesystem feature checks
  • 0.1.7 April 2022: Perf improvement for Windows CoW link creation by reducing kernel round-trips
  • 0.1.6 April 2022: Perf improvement for all Windows APIs
  • 0.1.5 October 2021: Separate exception type for when link limit is exceeded. Mac and Linux throw NotSupportedException.
  • 0.1.4 October 2021: Fix doc XML naming. Mac and Linux throw NotSupportedException.
  • 0.1.3 October 2021: Bug fixes for Windows. Mac and Linux throw NotSupportedException.
  • 0.1.2 October 2021: Performance fixes for Windows. Mac and Linux throw NotSupportedException.
  • 0.1.1 October 2021: Bug fixes for Windows. Mac and Linux throw NotSupportedException.
  • 0.1.0 July 2021: Windows ReFS support. Mac and Linux throw NotSupportedException.

Contributing

This project welcomes contributions and suggestions. See CONTRIBUTING.md.

Running Unit Tests on Windows

If you have a local ReFS drive volume on which to run ReFS related tests, set the following user or system level environment variable:

CoW_Test_ReFS_Drive=D:\

(You may need to exit and restart VS, VSCode, or consoles after setting this.) When this env var is not available, unit tests create and mount a local ReFS VHD for testing. You must run tests elevated (as admin), e.g. by opening Visual Studio as an admin before opening the solution.

Performance Comparisons

Windows

CoW links on ReFS take approximately constant time, saving time over file copies except at file size zero. The savings is proportional to the file size, with 16MB files at about 35X performance, 1MB at 3.2X, and small sizes at about 1.3X.

Detailed numbers for a VHD formatted empty with ReFS for each iteration, comparing System.IO.File.Copy() with CloneFile(), 50 copies/clones of a single source file per measurement, highest performance clone flags and settings. See CoWComparisons.cs. Machine was an 8/16-core, NVMe, Win11 22H1 Enterprise.

Method FileSize Mean Error StdDev Median Ratio RatioSD
File.Copy 0 202.8 us 6.79 us 19.92 us 199.5 us 1.00 0.00
CoW 0 192.8 us 5.40 us 15.57 us 191.2 us 0.96 0.12
File.Copy 1 346.8 us 9.65 us 27.99 us 345.8 us 1.00 0.00
CoW 1 237.5 us 7.38 us 21.28 us 239.3 us 0.69 0.08
File.Copy 1024 372.9 us 9.75 us 27.66 us 371.2 us 1.00 0.00
CoW 1024 248.0 us 6.97 us 20.21 us 248.5 us 0.67 0.07
File.Copy 16384 347.1 us 12.56 us 35.62 us 342.2 us 1.00 0.00
CoW 16384 248.8 us 9.76 us 28.47 us 244.3 us 0.72 0.11
File.Copy 262144 484.3 us 11.83 us 34.69 us 482.4 us 1.00 0.00
CoW 262144 247.8 us 8.51 us 24.69 us 245.6 us 0.51 0.06
File.Copy 1048576 954.4 us 19.06 us 41.85 us 948.0 us 1.00 0.00
CoW 1048576 251.3 us 9.77 us 28.64 us 246.8 us 0.27 0.03
File.Copy 16777216 9,867.2 us 195.87 us 546.01 us 9,605.5 us 1.00 0.00
CoW 16777216 283.5 us 6.56 us 18.60 us 282.7 us 0.03 0.00

Same benchmark performed on a ReFS partition (no VHD) on the NVMe disk:

Method FileSize Mean Error StdDev Median Ratio RatioSD
File.Copy 0 205.5 us 8.82 us 26.01 us 200.1 us 1.00 0.00
CoW 0 183.7 us 4.85 us 14.16 us 181.9 us 0.91 0.13
File.Copy 1 307.9 us 6.69 us 19.53 us 306.7 us 1.00 0.00
CoW 1 265.5 us 15.18 us 44.77 us 265.6 us 0.87 0.15
File.Copy 1024 310.3 us 6.56 us 18.82 us 310.4 us 1.00 0.00
CoW 1024 235.1 us 9.51 us 26.82 us 227.9 us 0.76 0.10
File.Copy 16384 295.5 us 5.80 us 15.39 us 295.2 us 1.00 0.00
CoW 16384 272.4 us 16.46 us 46.96 us 275.9 us 0.95 0.16
File.Copy 262144 425.5 us 9.29 us 26.80 us 423.8 us 1.00 0.00
CoW 262144 234.6 us 7.83 us 22.96 us 232.7 us 0.55 0.06
File.Copy 1048576 851.7 us 20.00 us 56.08 us 836.3 us 1.00 0.00
CoW 1048576 273.7 us 15.39 us 45.39 us 269.9 us 0.33 0.06
File.Copy 16777216 9,327.9 us 179.64 us 206.88 us 9,282.0 us 1.00 0.00
CoW 16777216 273.3 us 15.22 us 44.17 us 260.5 us 0.04 0.00
Product Compatible and additional computed target framework versions.
.NET net5.0 was computed.  net5.0-windows was computed.  net6.0 is compatible.  net6.0-android was computed.  net6.0-ios was computed.  net6.0-maccatalyst was computed.  net6.0-macos was computed.  net6.0-tvos was computed.  net6.0-windows was computed.  net7.0 was computed.  net7.0-android was computed.  net7.0-ios was computed.  net7.0-maccatalyst was computed.  net7.0-macos was computed.  net7.0-tvos was computed.  net7.0-windows was computed.  net8.0 was computed.  net8.0-android was computed.  net8.0-browser was computed.  net8.0-ios was computed.  net8.0-maccatalyst was computed.  net8.0-macos was computed.  net8.0-tvos was computed.  net8.0-windows was computed. 
.NET Core netcoreapp2.0 was computed.  netcoreapp2.1 was computed.  netcoreapp2.2 was computed.  netcoreapp3.0 was computed.  netcoreapp3.1 was computed. 
.NET Standard netstandard2.0 is compatible.  netstandard2.1 was computed. 
.NET Framework net461 was computed.  net462 was computed.  net463 was computed.  net47 was computed.  net471 was computed.  net472 was computed.  net48 was computed.  net481 was computed. 
MonoAndroid monoandroid was computed. 
MonoMac monomac was computed. 
MonoTouch monotouch was computed. 
Tizen tizen40 was computed.  tizen60 was computed. 
Xamarin.iOS xamarinios was computed. 
Xamarin.Mac xamarinmac was computed. 
Xamarin.TVOS xamarintvos was computed. 
Xamarin.WatchOS xamarinwatchos was computed. 
Compatible target framework(s)
Included target framework(s) (in package)
Learn more about Target Frameworks and .NET Standard.
  • .NETStandard 2.0

    • No dependencies.
  • net6.0

    • No dependencies.

NuGet packages

This package is not used by any NuGet packages.

GitHub repositories (1)

Showing the top 1 popular GitHub repositories that depend on CopyOnWrite:

Repository Stars
microsoft/MSBuildSdks
MSBuild project SDKs
Version Downloads Last updated
0.3.8 3,162 3/25/2024
0.3.7 8,020 9/11/2023
0.3.6 7,806 7/17/2023
0.3.5 162 7/13/2023
0.3.4 172 7/12/2023
0.3.3 165 7/11/2023
0.3.2 8,957 2/9/2023
0.3.1 299 2/8/2023
0.3.0 1,054 1/30/2023
0.2.2 365 1/27/2023
0.2.1 5,868 9/27/2022
0.2.0 443 9/15/2022
0.1.13 434 9/15/2022
0.1.12 447 9/15/2022
0.1.11 445 9/6/2022
0.1.10 403 9/3/2022
0.1.9 380 9/3/2022
0.1.8 487 4/19/2022
0.1.7 466 4/19/2022
0.1.6 417 4/19/2022
0.1.5 350 10/15/2021
0.1.4 337 10/14/2021
0.1.3 329 10/13/2021
0.1.2 319 10/13/2021
0.1.1 363 10/11/2021
0.1.0 378 7/10/2021