Persephone
Git-inspired distributed version control system built from scratch — handling repositories with 10,000+ files and 50+ concurrent operations.
Content-addressable object store using SHA-256 with data deduplication. Parallel diff computation via goroutines and buffered channels. Modular CLI with 15+ commands.
Key Engineering
- Content-addressable storage achieving 40% storage reduction through deduplication across 1,000+ commits
- Three-way merge engine processing 500+ file merges with 95% automatic conflict resolution
- Reduced diff computation time by 65% by parallelizing comparisons using 8 goroutines and buffered channels
- Modular CLI architecture with 15+ commands supporting repositories up to 5GB
Challenges Solved
- Deduplicating unchanged files across snapshots without full-tree traversal
- Three-way merge conflict detection with accurate line-level diffs
- Efficient streaming compression for large repositories
Demonstrates deep understanding of Git internals, file system design, and concurrent programming — building the tools, not just using them.