Large-scale Near-deduplication Behind BigCode - Tech Sentiments