Taming Big Data within the Corporate Litigation Lifecycle
SpotSigs Robust & Efficient Near Duplicate Detection in Large Web Collections Martin Theobald Jonathan Siddharth Andreas Paepcke Sigir 2008, Singapore.
Near-Duplicate Detection for eRulemaking Hui Yang, Jamie Callan Language Technologies Institute School of Computer Science Carnegie Mellon University.
Near-Duplicate Detection for eRulemaking
SpotSigs Robust & Efficient Near Duplicate Detection in Large Web Collections