diff --git a/Amazon Product Reviews/Amazon product reviews dataset b/Amazon Product Reviews/Amazon product reviews dataset deleted file mode 100644 index 8b161bf..0000000 --- a/Amazon Product Reviews/Amazon product reviews dataset +++ /dev/null @@ -1,6 +0,0 @@ -## Link to Dataset
-aggressively deduplicated data (18gb) - -No duplicates whatsoever (82.83 million reviews). file removes duplicates more aggressively, removing duplicates even if they are written by different users. This accounts for users with multiple accounts or plagiarized reviews. - -Format is one-review-per-line in (loose) json. See examples below for further help reading the data. \ No newline at end of file