GKZipLib NuGet Package

GKZipLib was written for fast parsing of ZIP archives generated by GrayKey in .NET.

Publicly available parsing libraries I tried in C# were either too slow at parsing large ZIPs or completely failed when attempting to parse GK zips (or both). Developing this library was a fantastic exercise that really enhanced my own personal understanding of how zip files work.

One of the big focuses of this library is being as fast as possible. Let's keep in mind GK zips can go from 5-10 GB to hundreds of GB in size. So how do we keep it fast? GKZipLib accomplishes this by parsing ONLY as much as it needs to, unless a file is identified (by path, etc) as needing to be extracted.

First, it parses the entire central directory into RAM.

The CD is typically quite small so this is doable. On a file-by-file basis, you can then decide whether or not to load additional details such as the data's absolute offset within the file, the file's compressed/uncompressed size, and so on.

Probably the most potent usage of this is what I'm going to call "LINQ to GKZip" -- taking advantage of the fact that the library implements IEnumerable and thus can be called with a simple foreach. Please see Example.cs for the simplest usage.

Contact the author on Discord - forensicmike#6426 or Twitter DM @forensicmike1.




Got any GKZipLib Question?





Info

Version: 1.0.1
Author(s): Mike Williamson (@forensicmike1)
Last Update: Wednesday, March 4, 2020
.NET Fiddle: Create the first Fiddle
NuGet Url: https://www.nuget.org/packages/GKZipLib


Install
Install-Package GKZipLib
dotnet add package GKZipLib
paket add GKZipLib
GKZipLib Download (Unzip the "nupkg" after downloading)



Tags



STATS

must-have-score

avg-downloads-per-day

0

days-since-last-release

1725