From 898ac85ad97890d959bdbd8972ab16e7fa278069 Mon Sep 17 00:00:00 2001 From: Rebecca Merrett Date: Mon, 10 Feb 2020 19:07:22 +0000 Subject: [PATCH] Delete Amazon product reviews dataset --- Amazon Product Reviews/Amazon product reviews dataset | 6 ------ 1 file changed, 6 deletions(-) delete mode 100644 Amazon Product Reviews/Amazon product reviews dataset diff --git a/Amazon Product Reviews/Amazon product reviews dataset b/Amazon Product Reviews/Amazon product reviews dataset deleted file mode 100644 index 8b161bf..0000000 --- a/Amazon Product Reviews/Amazon product reviews dataset +++ /dev/null @@ -1,6 +0,0 @@ -## Link to Dataset
-aggressively deduplicated data (18gb) - -No duplicates whatsoever (82.83 million reviews). file removes duplicates more aggressively, removing duplicates even if they are written by different users. This accounts for users with multiple accounts or plagiarized reviews. - -Format is one-review-per-line in (loose) json. See examples below for further help reading the data. \ No newline at end of file -- libgit2 0.26.0