Projects STRLCPY deduplicator Commits c6318a31
🤬
  • ■ ■ ■ ■ ■ ■
    README.md
    skipped 66 lines
    67 67   
    68 68  ## Performance
    69 69   
    70  -Deduplicator uses size comparison and fxhash (a non non-cryptographic hashing algo) to quickly scan through large number of files to find duplicates. its also highly parallel (uses rayon and dashmap). I haven't uploaded the benchmarks yet, but I was able to scan through 120GB of files (Videos, PDFs, Images) in ~300ms.
     70 +Deduplicator uses size comparison and fxhash (a non non-cryptographic hashing algo) to quickly scan through large number of files to find duplicates. its also highly parallel (uses rayon and dashmap). I was able to scan through 120GB of files (Videos, PDFs, Images) in ~300ms. checkout the benchmarks
     71 + 
     72 +## benchmarks
     73 + 
     74 +| Command | Dirsize | Mean [ms] | Min [ms] | Max [ms] | Relative |
     75 +|:---|:---|---:|---:|---:|---:|
     76 +| `deduplicator --dir ~/Data/tmp` | (~120G) | 27.5 ± 1.0 | 26.0 | 32.1 | 1.70 ± 0.09 |
     77 +| `deduplicator --dir ~/Data/books` | (~8.6G) | 21.8 ± 0.7 | 20.5 | 24.4 | 1.35 ± 0.07 |
     78 +| `deduplicator --dir ~/Data/books --minsize 10M` | (~8.6G) | 16.1 ± 0.6 | 14.9 | 18.8 | 1.00 |
     79 +| `deduplicator --dir ~/Data/ --types pdf,jpg,png,jpeg` | (~290G) | 1857.4 ± 24.5 | 1817.0 | 1895.5 | 115.07 ± 4.64 |
     80 + 
     81 +* The last entry is lower because of the number of files deduplicator had to go through (~660895 Files). The average size of the files rarely affect the performance of deduplicator.
     82 + 
     83 +These benchmarks were run using [hyperfine](https://github.com/sharkdp/hyperfine). Here are the specs of the machine used to benchmark deduplicator:
     84 + 
     85 +```
     86 +OS: Arch Linux x86_64
     87 +Host: Precision 5540
     88 +Kernel: 5.15.89-1-lts
     89 +Uptime: 4 hours, 44 mins
     90 +Shell: zsh 5.9
     91 +Terminal: kitty
     92 +CPU: Intel i9-9880H (16) @ 4.800GHz
     93 +GPU: NVIDIA Quadro T2000 Mobile / Max-Q
     94 +GPU: Intel CoffeeLake-H GT2 [UHD Graphics 630]
     95 +Memory: 31731MiB (~32GiB)
     96 +```
    71 97   
    72 98  ## Screenshots
    73 99   
    skipped 2 lines
Please wait...
Page is in error, reload to recover