STRLCPY/deduplicator

added benchmarks
sreedev committed 1 year ago

c6318a31

1 parent 76c69ef8

Total 1 files

■ ■ ■ ■ ■ ■ ■

README.md

		skipped 66 lines
67	67
68	68		## Performance
69	69
70		-	Deduplicator uses size comparison and fxhash (a non non-cryptographic hashing algo) to quickly scan through large number of files to find duplicates. its also highly parallel (uses rayon and dashmap). I haven't uploaded the benchmarks yet, but I was able to scan through 120GB of files (Videos, PDFs, Images) in ~300ms.
	70	+	Deduplicator uses size comparison and fxhash (a non non-cryptographic hashing algo) to quickly scan through large number of files to find duplicates. its also highly parallel (uses rayon and dashmap). I was able to scan through 120GB of files (Videos, PDFs, Images) in ~300ms. checkout the benchmarks
	71	+
	72	+	## benchmarks
	73	+
	74	+	\| Command \| Dirsize \| Mean [ms] \| Min [ms] \| Max [ms] \| Relative \|
	75	+	\|:---\|:---\|---:\|---:\|---:\|---:\|
	76	+	\| `deduplicator --dir ~/Data/tmp` \| (~120G) \| 27.5 ± 1.0 \| 26.0 \| 32.1 \| 1.70 ± 0.09 \|
	77	+	\| `deduplicator --dir ~/Data/books` \| (~8.6G) \| 21.8 ± 0.7 \| 20.5 \| 24.4 \| 1.35 ± 0.07 \|
	78	+	\| `deduplicator --dir ~/Data/books --minsize 10M` \| (~8.6G) \| 16.1 ± 0.6 \| 14.9 \| 18.8 \| 1.00 \|
	79	+	\| `deduplicator --dir ~/Data/ --types pdf,jpg,png,jpeg` \| (~290G) \| 1857.4 ± 24.5 \| 1817.0 \| 1895.5 \| 115.07 ± 4.64 \|
	80	+
	81	+	* The last entry is lower because of the number of files deduplicator had to go through (~660895 Files). The average size of the files rarely affect the performance of deduplicator.
	82	+
	83	+	These benchmarks were run using [hyperfine](https://github.com/sharkdp/hyperfine). Here are the specs of the machine used to benchmark deduplicator:
	84	+
	85	+	```
	86	+	OS: Arch Linux x86_64
	87	+	Host: Precision 5540
	88	+	Kernel: 5.15.89-1-lts
	89	+	Uptime: 4 hours, 44 mins
	90	+	Shell: zsh 5.9
	91	+	Terminal: kitty
	92	+	CPU: Intel i9-9880H (16) @ 4.800GHz
	93	+	GPU: NVIDIA Quadro T2000 Mobile / Max-Q
	94	+	GPU: Intel CoffeeLake-H GT2 [UHD Graphics 630]
	95	+	Memory: 31731MiB (~32GiB)
	96	+	```
71	97
72	98		## Screenshots
73	99
		skipped 2 lines

added benchmarks