Amazon product reviews dataThis dataset contains product reviews and metadata from Amazon, including 142.8 million reviews spanning May 1996 - July 2014.This dataset includes reviews (ratings, text, helpfulness votes), product metadata (descriptions, category information, price, brand, and image features), and links (also viewed/also bought graphs).This dataset is probably preferable for sentiment analysis type tasks.Link to Datasetaggressively deduplicated data (18gb)No duplicates whatsoever (82.83 million reviews). file removes duplicates more aggressively, removing duplicates even if they are written by different users. This accounts for users with multiple accounts or plagiarized reviews.Format is one-review-per-line in (loose) json. See examples below for further help reading the data.Sample reviewIMAGE HEREwherereviewerID - ID of the reviewer, e.g. A2SUAM1J3GNN3Basin - ID of the product, e.g. 0000013714reviewerName - name of the reviewerhelpful - helpfulness rating of the review, e.g. 2/3reviewText - text of the reviewoverall - rating of the productsummary - summary of the reviewunixReviewTime - time of the review (unix time)reviewTime - time of the review (raw)