Commit 2722d78a by Jonathan Kelly

Switchup scraping update

Code finished without comments or cleanup
parent 8746b005
{}
\ No newline at end of file
...@@ -18,13 +18,13 @@ ...@@ -18,13 +18,13 @@
}, },
{ {
"cell_type": "code", "cell_type": "code",
"execution_count": 2, "execution_count": 4,
"source": [ "source": [
"import requests\r\n", "import requests\r\n",
"from bs4 import BeautifulSoup\r\n", "from bs4 import BeautifulSoup\r\n",
"\r\n", "\r\n",
"\r\n", "\r\n",
"URL = \"https://www.switchup.org/bootcamps/data-science-dojo?page=1\"\r\n", "URL = \"https://www.switchup.org/bootcamps/data-science-dojo?page=\"\r\n",
"page = requests.get(URL)\r\n", "page = requests.get(URL)\r\n",
"soup = BeautifulSoup(page.content, \"html.parser\")" "soup = BeautifulSoup(page.content, \"html.parser\")"
], ],
...@@ -40,7 +40,7 @@ ...@@ -40,7 +40,7 @@
}, },
{ {
"cell_type": "code", "cell_type": "code",
"execution_count": null, "execution_count": 5,
"source": [ "source": [
"n=1\r\n", "n=1\r\n",
"while True:\r\n", "while True:\r\n",
...@@ -56,7 +56,15 @@ ...@@ -56,7 +56,15 @@
"main_page_url_list = [URL + str(i) for i in range(i,n)]\r\n", "main_page_url_list = [URL + str(i) for i in range(i,n)]\r\n",
"print(main_page_url_list) # check" "print(main_page_url_list) # check"
], ],
"outputs": [], "outputs": [
{
"output_type": "stream",
"name": "stdout",
"text": [
"['https://www.switchup.org/bootcamps/data-science-dojo?page=1', 'https://www.switchup.org/bootcamps/data-science-dojo?page=2', 'https://www.switchup.org/bootcamps/data-science-dojo?page=3', 'https://www.switchup.org/bootcamps/data-science-dojo?page=4', 'https://www.switchup.org/bootcamps/data-science-dojo?page=5', 'https://www.switchup.org/bootcamps/data-science-dojo?page=6', 'https://www.switchup.org/bootcamps/data-science-dojo?page=7', 'https://www.switchup.org/bootcamps/data-science-dojo?page=8', 'https://www.switchup.org/bootcamps/data-science-dojo?page=9', 'https://www.switchup.org/bootcamps/data-science-dojo?page=10', 'https://www.switchup.org/bootcamps/data-science-dojo?page=11', 'https://www.switchup.org/bootcamps/data-science-dojo?page=12']\n"
]
}
],
"metadata": {} "metadata": {}
}, },
{ {
...@@ -65,7 +73,7 @@ ...@@ -65,7 +73,7 @@
"source": [ "source": [
"# for x, _ in enumerate(range(1,n)):\r\n", "# for x, _ in enumerate(range(1,n)):\r\n",
"# url = main_page_url_list[]\r\n", "# url = main_page_url_list[]\r\n",
"page = requests.get('https://www.switchup.org/bootcamps/data-science-dojo?page=2')\r\n", "page = requests.get(url)\r\n",
"soup = BeautifulSoup(page.content, \"html.parser\")\r\n", "soup = BeautifulSoup(page.content, \"html.parser\")\r\n",
"pretty_soup = soup.prettify()\r\n", "pretty_soup = soup.prettify()\r\n",
"with open(\"Review page \" + str(2) + \".txt\", \"w\", encoding=\"utf-8\") as f:\r\n", "with open(\"Review page \" + str(2) + \".txt\", \"w\", encoding=\"utf-8\") as f:\r\n",
......
Name,Job,Date of Review,Review,Overall Rating,Curriculum Rating,Job Support Rating
Jacqueline,Marketing Consultant,7/14/2021,I really enjoyed the experience overall. The instructors were thorough and attentive in providing support for all learners whether new or experienced. I feel confident that I can build on the knowledge and skills I've gained to become a data science practitioner. ,100,100,NA
Jonathan,Data Science Intern,7/13/2021,"I was making a career change and unsure about what course/bootcamp to take. After doing research, I decided Data Science Dojo was the best fit for me. It was such an awesome experience and I have learned a TON! The instructors were knowledgeable and were able to keep things easy to understand while still providing a challenging environment to grow. They never balked at anyone or made them feel small and I was never afraid to ask a question. Office hours were a great time to get 1-on-1 tutoring too. I feel prepared and ready for the next step in my career. I would definitely recommend.",100,100,100
Kevin Lee,N/A,5/12/2021,"This bootcamp is for people who want to get their feet wet in the data science field. The instructors were amazing, the platform was amazing, and the resources they had were amazing. The way the instructors broke down the core data science concept was very easy to understand.",100,100,NA
Jeffrey Bierman,Graduate Student in Chemical Biology,4/22/2021,"Want a solid overview of common Python fundamentals for data analysis? Want down-to-earth instructors that are actually interested in their work, and you? Want to know how to evaluate data, clean it, visualize it, polish it, and feed it into a pipeline for machine learning? Want all of this in a five-day course? Look no further. I didn't know lists from dictionaries before this, and now I have a lot of essential tools at my disposal to do bioinformatic analysis for my graduate research. Not only that, but all of the course material is available to me for an additional six months after the course ended, so I consider more than five days of instruction. Plus, the instructors make sure they are available to help you and set you up for success. To add, I really liked the style of this course, and I think it is so pragmatic for learning programming. I love that our live sessions were recorded so that we can go back and see exactly what we saw before. I think my favorite aspect were the instructors, because they were very kind, knowledgeable, and made everything less intimidating. I love that the instructors were all around the world too! Such a cool way to showcase the wonders of working online and utilizing computers to learn about computers. I will hopefully be in contact with Data Science Dojo again in the future! Thank You!",100,100,100
Sarah,Product Manager,2/19/2021,"The online data science bootcamp offered by Data Science Dojo was a really valuable investment. Over a course of approximately 4 months, we had weekly 3-hour-long live classes with optional homework to deepen what we had learnt. From building and evaluating Machine Learning Models, to parameter tuning to text analytics and linear regression (and much more) - many relevant topics were covered and explained with such patience and good examples that even someone like me from a more non-technical/non-mathematical background was able to understand and follow along. The instructors were empathetic and extremely knowledgable and welcomed students' questions at any time throughout the course. The homework was appropriate and voluntary - the suggested reading material was interesting and contributed to a better understanding of certain topics.",100,100,NA
Anonymous,Evaluator,2/16/2021,"The boot camp provided me an overview of all key machine learning tools in a practical way that I could apply them in my work, and grow my expertise. I would highly recommend it. It allows you to think about the use case for the approaches, and when and where to most effectively use them.",100,100,100
Anonymous,N/A,2/16/2021,"This course has something for everyone! I especially was happy to learn more about R and Azure. If you are looking for help with text analytics, predictive modeling, logistic regression, they teach on these topics and more. The instructors are there to answer questions and to go in-depth during class and office hours.",100,100,100
Sumit Hore,Phd student in Knowledge Management,12/9/2020,"I did a lot of research on data science boot camps before I finally choose the Data Science Dojo. I joined the boot camp with a business and economics background to enhance my understanding of data science and more specifically on predictive modelling and machine learning.The classes were intensive, interactive, full of practical examples, exercises, and labs. The tutors and mentors were extremely competent, friendly, and available to provide one-to-one help to solve the class assignments and projects. This boot camp also allows the participants to earn 7 credit points from the University of New Mexico, USA.After attending the boot camp, I strongly believe that it can be a great start for anyone who loves data and to build their career in data science.",100,100,100
Ryan Eaton,Researcher,12/5/2020,"I came to the bootcamp with a background in scientific data analysis hoping to broaden my understanding of predictive models and machine learning. Data Science Dojo surpassed my expectations through combination of in-person, discussion-oriented classes and practical, hands-on exercises. Modules on feature engineering and cross-validation techniques I found especially useful. From day one it was obvious that emphasis would be on algorithms and rationale underlying the predictive models covered over intricacies of the particular functions/libraries used to generate them. Though I had no trouble figuring out how to code to train and test models after working through the well-documented detailed exercises. All-in-all, Data Science Dojo is a strong choice for the student wanting to understand machine learning fundamentals.",86,100,80
Anonymous,N/A,12/3/2020,"I'm really converted. I never thought a data science bootcamp course could bring such great value to me but here I am converted let ring you know the DoJo bootcamp is the best data science course I recommend strongly. The teaching is excellent and class discussion is really interesting. The school also arranged the course time fitting my timezone which was amazing. If anyone wishes a strong foundation and basis of data science, I strongly recommend this course. It gives you confidence and makes you feel you grasped the important concepts. For me, it gets five stars.",100,100,NA
S. Dali,Data Scientist,12/3/2020,"This is a boot camp for Data Science: if this means for you short, intensive and rigorous training, you're in the right place.I think this is for any one who doesn't have a formal training in Machine Learning and wants to get familiar with the field, the methods and concepts used. I think this is a good place to start and expand by practicing. I also think Team Managers who want to know more can also benefit from this curriculum.The classes where very friendly and full of practical examples. Best of all, you still have access to the material after the boot camp.I would strongly recommend it to anyone. It is definitely money and time well spent.",94,80,100
Javier,Economist,11/18/2020,"This online course was excellent. The main professor is, not only an excellent instructor, but a super approachable person. He even add classes to complete the program since we got delay because we asked so many questions. He is totally patient. After that, I wanted to check another bootcamp, which I am doing right now in a very prestigious university and it is not half as good as Data Science Dojo. The fact that there are not many students (we were something like 15), gave the professor the chance to go over all the questions. Overall, 5 out of 5, no doubt about it.",100,100,NA
Anastasia,Portfolio Manager,10/22/2020,This course not only gives a great overview of the variety of data science methods but also comes with a lot of practical experience for example through an in-class Kaggle competition. For the practical exercises the code is provided in R as well as in Python which I found very helpful. The bootcamp also focuses on how to apply data science in order to actually support the business.,100,100,100
Ottmar,N/A,10/10/2020,"The comprehensive curriculum, together with their unique learning platform and tools, provided us with an impressive balance of required theory, engaging practical exercises and tons of supplemental material. If that wasn’t enough, you get online office hours to get LIVE support for anything needed on the course. After each class, you get an email summary of the topics discussed, additional optional exercises, and can also access the recording of previous sessions if you wish to go back and review it.Raja, despite being the CEO and having a strong technical team supporting during each LIVE session, takes the teaching into his own hands to prove that this is not a mere business, but his passion. He has a natural gift to immerse you into the topics with simplicity, depth, and hands-on practice skillfully, while continuously encouraging us to think beyond the basics or what is seemingly obvious. While he could easily fast forward to the hands-on and call it a day, he ensures that we all understand the concepts behind the practical application, in what end up being three intense hours of LIVE learning each week.An EXCELLENT learning experience that is highly recommended!",100,100,100
Gabriel Buehler,Sr Account Executive,9/22/2020,"We were half when through 2020 when our cohort started. This was the first time DSD hosted an online learning bootcamp. At first, I was disappointed as I'm a very hands-on tactical learner. I was also completing my BS in BusAdmin online at the time, so I had a great comparison to the online learning environment created by a large university. Additionally, I had spent a year or so trying to learn what I could on the Courseras and other online learning platforms. For starters, Raja made the entire experience engaging and fun even at times when class enthusiasm wasn't especially high. Every class was valuable and packed with ML theory and best practices. It provided a good foundation in coding for ML, an incredible deep dive into terms and methodologies, explored the math and logic behind the integral statistics for ML, and continuing resources that will ensure my success moving forward. This DID NOT feel like their first run at an online bootcamp. This was comprehensive and engaging. When things clear up from COVID I will likely be trying to attend the in-person bootcamp as well. You will be required to participate. You will learn data visualization and cleansing methods (the most important part). You will build and tune models with the expected understanding of the algorithms you're using and parameters you're tuning and why. You will use common industry practices and MANY software platforms to develop your business problem solving stack. You'll come away with enough new information to realize that you've barely scratched the surface in becoming a data scientist. You'll be one step closer to know what you don't know, but you'll also be given the resources to find the answers.With that said, I've said to much. I hope my appreciation for this course comes through. If you get a chance to attend Datascience Dojo's online or in-person bootcamp, you should.",86,100,60
Anonymous,N/A,3/11/2020,"I attended the 3-day Hands on Introduction to Data Science. Raja is an amazing instructor, and Arham is kind and patient. These two make a great team. Using Azure Machine Learning tools was a great way to build models and deploy them. The material that was covered was fairly comprehensive and I personally learned more than I thought possible. I do feel that the material is not meant for a 3-day course, so at times it did not feel like a smooth transition from one topic to another. Also, this course is most appropriate for those that have current data science/programming knowledge-it was not truly meant for beginners. This course is a great introduction to the topic, and I highly recommend it.",100,100,100
Anonymous,N/A,2/27/2020,"This class will stretch your mind in ways you never thought possible. Although actually, after taking this class, it is possible to articulate the ways and then create a predictive model for it! The real world examples help significantly apply data science techniques. Classes are very engaging and there is technical support on hand at all times. I highly recommend this class if you are an inspiring data scientist, data engineer, Database admin, or just need some inspiration in your current data role.",100,100,100
Steve Rusev,N/A,2/12/2020,"I attended the 5 day Data Science & Data Engineering Bootcamp in Austin, TX and want to say that it was great. The material that we covered was comprehensive and gave you deep understanding of the fundamentals of machine learning and data science. The practical workshops were easy to setup and execute for person without any programming experience and if you are experienced developer you can use them as a building blocks to construct much complex models. I highly recommend Data Science Dojo to anyone who wants to learn machine learning.",100,100,100
Miles Pomeroy,N/A,2/7/2020,So much information! Attending this bootcamp is like having a semester/year long college level class in Data Science all in one week. The bootcamp focuses on theory over tools establishing the fundamentals needed to then go forth and self-learn the tools needed for your specific applications.,100,100,100
Yulia Dorofeyuk,N/A,1/16/2020,"A brilliant experience with a super intensive study from early morning till late hours. An outstanding lecturer, extremely useful knowledges and co-working in our great international group (i guess there were only a few country matches from the whole group). Very inspiring week! As a general result I would mention lots of insights even for a well known models, great new people to get acquainted with and for sure motivation 80 lvl for the further improvement! P.s. Students still have access to a learning portal after the bootcamp, that contains everything we had during bootcamp as well as post-bootcamp tutorials and content!",94,100,80
Omar Al,N/A,1/13/2020,"The program was intense and extremely useful for beginners but also for experienced data scientist or engineers from different background. The Pros were as follow : - Brief theoretical Presentations- Extensive Practical Examples - Mentoring during practical examples if needed.Except the fact that having a 5 days BootCamp was quite challenging for some participants, and some subjects were not largely developed due to time constraint, I can say that myself and the majority of the attendees have acquired enough knowledge in data science and machine learning to start dealing efficaciously with their day to day corporate problems.",100,100,100
Yong Jian Chong,N/A,11/30/2019,"It was an incredibly enriching experience - coming from someone who has struggled to understand concepts and put things to practice from online courses. The balance between theory and practice was great, and it allow us to develop a foundation in data science. What made this possible was the agility & knowledge of the instructors (yes, the founder teaches!), and you will be placed in a safe environment to learn and improve. If you're looking at upgrading yourself, then I highly recommend this course!",100,100,100
Zainab Al Lawati,N/A,11/29/2019,"It was the best training I've ever attended. The 5 days were packed with knowledge, great discussion, and hands-on experience.The learnings were also solidified by the additional exercises provided for practice.Despite the intensity, it was easy to follow and never boring :D The bootcamp also offers additional webinars and video tutorials covering more topics.The instructors: Raja and Rebecca did amazing job in teaching and making sure we understand the core concepts regardless of the tools used. They encouraged and emphasised on the importance of having a decent thinking process. This was the most valuable outcome for me, as it is easy to find tutorials online on the ""how-tos"", but understanding the full picture and the reasonings is challenging. Overall, attending this bootcamp was one of the greatest decisions I made.",100,100,100
Jerry Chee,N/A,11/29/2019,"The bootcamp was led by the CEO himself, who attacks the topic from very practical angles. In comparison with Coursera, DataCamp or other in-person experiences I’ve had so far on programming/data science, I definitely felt like Raja managed to, not just select, but engage us in the most important concepts on both Data Science and Data Engineering (trust me, I’ve spent hours googling about these topics and speaking to a variety of ‘techies’, but just couldn’t get that ‘full picture’ view of how it all comes together). The course is very well structured, laying out ground concepts first for every topic and then useful practicum. The instructors are adamant that the thinking process is the most crucial element and I fully agree. Healthy debate is encouraged and the instructors are able to tie it all together by sharing practical experiences they've had. If you've ever been taught by practising adjunct professors in school, versus those who have been in academia all their life, you'll know the difference - especially when you only have 5 days to spare to immerse yourself in a new topic.I would strongly recommend going for DSDojo's bootcamp if you can spare the full 5 days if you're-1)An analyst already working with data analysis and want to do more2)Manager who wants to cut thru' the buzzwords and know what's relevant for your business today3)Data science enthusiast who's implemented some models before but know that you need your work to be more polishedI would also recommend that you don't go in 'cold'. You should finish the pre-course videos and exercises in order to extract the most of the 5-days. Don't be lazy!",100,100,100
Hajar,N/A,11/29/2019,"The whole experience has been very enlightening.The concepts were explained thoroughly without the hardcore jargon.Data Science Dojo sets a strong foundation for each of its attendees ,paving way for me to explore my own path in data science further.Thank you Rebecca and Raja for the awesome opportunity and I hope to be relaying good news in the months to come!!",100,100,100
Kelvin Lui,N/A,11/29/2019,"The best part of this intensive course is the deepdive and discussion like approach to the fundamentals of Machine Learning. Accuracy of models dominates the purpose of any model, in a general sense, however this course allows for the greater picture to come into view. As they say, with great power comes great responsibility and Data Science holds great power.",100,100,100
Steve Sabatini,N/A,11/27/2019,"This course is exactly what I needed. It has given me an understanding of the tools and concepts by putting it all into the perspective that I need to be able to drive initiatives in my company. The instructors are excellent and the course is immersive with a great balance of instruction and hands-on work to put into practice what you're learning throughout the week. I have been pleasantly surprised (amazed, really!) at how accessible the entire world of data science and machine learning have become as a result of the bootcamp.I also want to specifically highlight a few stand-out things for others who may be interested:1. The instructors, especially Raja, were able to take their excellent data science expertise and put it into business context, always keeping focus on the big picture of business goals and desired outcomes and not just the technology tools themselves. 2. The team has an excellent ability to convey complex topics with real-world, practical examples and it was a huge help in understanding the many important points covered during the bootcamp.3. I've been extremely impressed with the enormous amount of high quality supporting materials (slides, handouts, videos, code examples, documents, interactive learning tools, etc) that are part of the bootcamp curriculum. All of the content and exercises are laid out in a comprehensive way that helps with advance preparation before bootcamp right through the bootcamp itself and all of those materials are available after the bootcamp for review and additional learning. If you're looking for an immersive one-week, end-to-end introduction to data science, you will thank yourself for choosing the Data Science Dojo bootcamp.",100,100,100
Anurag Rastogi,N/A,11/26/2019,This class is must for the people starting in Data science/ analytics. Instructions were really clear and instructors were really awesome and has in depth knowledge. I am glad I attended the boot camp and it completely change how I see the Data now .,94,100,80
Fred Lissau,N/A,11/26/2019,"I attended the Washington DC November 2019 class with Raja, Arham, and Tarun. Raja as a lead instructor was a treat. In an exhausting but comprehensive way, we finely examined fundamental components of data science and collectively gained the foundations upon which to build quality models and analytics. It's difficult to articulate all of the little qualities and attention to detail that the Data Science Dojo team has invested into building an informative course and a valuable experience.",100,100,100
Maria Sylte,N/A,11/26/2019,"This was a highly productive way to spend a week. Using easy-to-grasp examples, we walked through numerous data science concepts that are highly applicable to real-world challenges. Raja designed the class to be both easily accessible to those with minimal data science/programming backgrounds, but also in a way that exposed actionable, comprehensive techniques to solve real-world problems. This class will fundamentally change how I solve business problems in the future.",100,100,100
Edward Love,N/A,11/25/2019,"This boot camp focuses on fundamentals and high-level concepts rather than teaching you how to program. Though there is instruction in R, that is not the focus. As someone who has been using R in my day-to-day work for the past few years, this was a plus for me. What I needed was a place that would fill in my gaps in foundational knowledge that I knew I had and to point me in the right direction for further study--this boot camp did exactly that. A lot of knowledge is packed into 5 days, and I have left feeling more confident that I can create (better) predictive models and efficiently deploy them to production. There are tons of resources out there for learning to program, but the foundational knowledge is what seems to be lacking in short programs. I'll recommend this to my friends and co-workers. Just be aware that this is not a program that is set up to help you find a job. Most people in the program already have full time jobs, and this boot camp is oriented toward making them more effective at their existing jobs.",86,100,60
Jonathan Shaanan,N/A,11/23/2019,"As a venture capitalist, almost all startups are pitching that they do AI and machine learning, but much of it is window dressing. After this bootcamp, I finally have the tools to have an intelligent conversation with any data scientist and decipher which startups are actually deploying the technique effectively. I wish I would have done the bootcamp several years ago, and I believe it should be required training for all venture capitalist who do not have a background in AI.PROS:+ Great balance between in-class lectures and labs where we apply the techniques+ No need to have a technical background to get a lot of this course+ Raja is a great teacher who is passionate about the subjectCONS:+ None",100,100,100
Anonymous,N/A,11/22/2019,"Great class. I did not have much hands on experience at all coming into this bootcamp, so was a little apprehensive. But the course was very well paced and taught. Raja is an excellent teacher. I understand the concepts now that earlier seemed foreign and I understand them fairly well. I would highly recommend this to folks at all levels. And I would choose this bootcamp again if I had to do this over again. Good luck, DSDojo!",100,100,100
Rockel,N/A,11/22/2019,"This training was an all around great experience. Data Science, Data Engineering and Machine Learning are complex but this training was presented in a way that anyone with basic business or technical skills can easily understand and apply the concepts. The intent is not to send you home with a plug and play model - instead you leave will all the skills needed to build models that are applicable to any business problem that requires machine learning. The true magic of machine learning is understanding how to clean, transform and use data to build robust effective models - which is covered in depth in this course. You also leave the training with a comprehensive understanding of open source tools you can use immediately in any industry. The instructors are all extremely knowledgeable and went above and beyond to help students grasp the concepts and provide guidance on how to apply them to business needs.",100,100,100
Ash Singh,N/A,11/22/2019,"After looking for at multiple data science / AI / ML programs (both online and in person) I landed on the 5 day bootcamp as that was the only one which allowed a structured approach to learn foundations of the subject in a compressed format. Initially I was skeptical of the value of trying to learn such a complex topic in just 5 days. Let me tell you though - the DSD team did a phenomenal job deconstructing the topic, setting the stage to build a strong foundation and provided tools for continued development post the 5 day bootcamp. Obviously a lot of what you get out of any program is what you put into it but I find it extremely important to have a good structure. Initially I was a bit apprehensive as I lack technical / programming background. I was pleasantly surprised by the business led approach to problem solving that really a lot of people miss. Raja is an amazing educator, super passionate about the topic and has immense industry experience that is on full display throughout the training. He really broke down fairly complex concepts by putting things in context. Even though it was a fully large class Raja was able to keep everyone engaged throughout the 5 days of bootcamp and beyond! As I leave the program, I feel fully empowered with tools and foundations to continue learning as well as applying data sciences to my work. It has given me a new perspective and ideas that I can't wait to apply to real life scenarios. I cannot recommend this program enough to anyone who is curious and willing to spend some time learning data science / analysis concepts. As they say - Data is the New Oil!",94,100,80
SM,N/A,11/22/2019,Rapid learning and significant value creation towards career development. Focuses more on fundamentals than just the tools which lays a strong foundation for learning & development further in this domain. Course selection is well balanced between theory and practical exercise.,100,100,100
Piethein Strengholt,N/A,11/21/2019,This is seriously one of the best bootcamps I have ever attended! Instead of learning how to use tools Raja gives you a very solid and broad understanding of how data science works and what the underlying concepts are about. Highly recommended to anyone. ,100,100,100
Jeff Smith,N/A,11/13/2019,One of the best trainings I have ever attended. Raja and his team packaged all of the content that you need to have a well balanced understanding of data science concepts into a well orchestrated week long session that you can go back to work and instantly start applying. Highly recommended to anyone looking to gain these skills!,100,100,100
Milton Sato,N/A,11/3/2019,"Raja and team provide insightful tips in each step of the way. Basically cutting down reading time that I would have to do browsing and learning what to look for. The hands-on structure of the course helped me to get ""practice"" knowledge instead of just ""head"" knowledge of concepts. Through the concepts, I could identify a couple of low hanging fruits in data analysis and visualization, in which I could apply the concepts to my work context.It was very cool to be able to run a machine learning model from scratch and understand parameters involved in optimizing it in a real world context. Attending the program made me comfortable to discuss data science applications during job interviews.Finally, I think the experience at the camp gave me a foundation to jump start my learning from there.",94,100,80
Franco Parico,N/A,10/30/2019,"I took this course with very little prior knowledge in data science, with the intention of having enough background knowledge to be a better product manager in the data science/machine learning space. After this bootcamp, I have achieved that and more. Raja and the team have put together a comprehensive curriculum that gives students a solid foundation in the fundamentals of data science, with less of an emphasis on getting answers and more on framing problems to understand what the right questions to ask are. It was a very good balance of instruction, hands-on exercises, and open discussions to keep everyone engaged and learning.",94,100,80
Matt St. Louis,N/A,10/28/2019,"This is an extremely intensive week, but if you are interested in starting, or continuing, your data science journey, I can't recommend this highly enough. Raja and his team do an amazing job of covering a broad range of topics, while insuring that your understanding is there before moving on. They also leave you with numerous tools, examples and other references to continue after the week is over. I got far more out of this one week than I have in over a half dozen on-line courses.",100,100,100
Sri Malladi,N/A,10/25/2019,"This is hands-down the best training class (business or technical) I've attended in the last several years, and an excellent value for money and the time spent. Raja and his team do an amazing job of making everything seem accessible and they demystify all the concepts. Every concept is both theory and practice - from decision trees, to recommendation engines, to Hadoop clusters to IoT. Coming in, I had zero hands-on experience with the topics, but by Day 5, I feel comfortable enough to hold an educated conversation with data scientists, and I'm confident that if I invest more time learning (using their website and online tutorials), I can build something that works. I also feel confident enough to map the business problems I see in the financial services industry to appropriate techniques.Raja leads most of the conceptual teaching, and his team do most of the hands-on tool-based exercises. Raja is able to explain complex concepts very simply, and takes the understanding to an intuitive level - there is no hiding behind sophisticated terms. The Kaggle competition was fun, and I learnt a lot by spending time tweaking my models. The class requires a ""programmer"" mindset to get the most of it. It is an intense week, and I had to spend about 1-2 hours every day after the classes, doing the HW - sometimes I found this challenging. There were some concepts which were not covered due to lack of time - but like the 80-20 rule, I feel that everything essential was covered very deeply and thoughtfully. The location was at times a little noisy (another class next door) and warmer than I liked but nothing too bad.Overall - strongly recommend!",100,100,100
Tomas Perez,N/A,10/25/2019,"I came to this program with an engineering background, and this program opened my eyes to the power of data science tools to understand my data and more effectively solve problems at work. The instructors are knowledgeable, experienced, and easy to understand. I'm impressed at how much information I was able to comprehend in such a short time.",100,100,100
Vijay Bulusu,N/A,10/24/2019,"The program is well balanced to cater to students with prior background in data science as well as to students who have no prior experience. All the instructors are great but in particular Raja is amazing. He puts in a lot of effort to ensure his entire class understands the concepts and knows how to apply the concepts (he calls it reasoning). The location in NY could have been better but for the most part everything went smoothly. The course materials, online tools and software packages were all excellent. I look forward to being part of the alumni of Datascience Dojo.",100,100,100
Tudor Oprea,N/A,10/15/2019,"Pros:as a data science professional, but mostly on the ""user"" side, I found this course to be refreshing, entertaining and VERY informative.The instructors were very helpful, engaging, and in constant dialogue with the audience.And everyone had a chance to run hands-on machine learning simulations on several datasets. Cons:I could not think of any. However, be prepared to spend an entire week on this course - it is fully immersive. But you won't mind, because your brain will start spinning.",100,100,100
Laura Alejandra Garces,N/A,10/15/2019,"The boot camp is an excellent beginning for people who want to start working in data science, and for those who already have a journey on the subject, it serves to remember the fundamentals and logic behind the implementations and get tips and suggestions based on better practices and experience of the instructor. The boot camp includes many material to prepare you for the coming classes and excellent examples implemented in different tools for future projects. All shared through an online platform easy to access and use.",100,100,100
CARLOS HIROSHI NITTA,N/A,10/13/2019,"Five days and 50 hours is a lot and definitely not enough. I describe like this, imagine yourself taking the red pill and then you start learning all about what is really happening with and around you in terms of data science technology. I am very grateful for the opportunity to meet Raja, Arham, Tarun and Rebecca. Thank you all for sharing a little bit of your knowledge and experience with me. The amount of accumulated knowledge you guys have is really impressive. I definitely recommend anyone with or without data science background and experience to sit and listen what the Data Science Dojo Sensei have to say. It was a great honor and terrific experience to be part of the first Data Science and Data Engineering joined with The University of New Mexico Continuing Education.",100,100,100
Emily Paynter,N/A,10/11/2019,"As a beginner, I was concerned that I would be lost. Data Science Dojo made all of the information understandable for every experience level. The instructors were all engaging and knowledgeable. They also provide a lot of learning material for after the course is over. I feel like I will be able to apply the skills I learned at the Data Science Dojo Bootcamp in my career. Overall, it was a really great experience.",100,100,100
Andrew Donate,N/A,10/11/2019,"As I searched through the several companies that offered in-person data science training, I decided to take the trip out to Albuquerque and attend this bootcamp. Having enrolled in an online degree seeking program for data science, I still didn't wasn't convinced this was the way to acquire the theory and practical skills needed to explore more beyond as an everyday data analyst. After weighing all of my options and attending this bootcamp, I have to say, I definitely made the right decision. Attending here I definitely received what I was looking for. I learned the ""theory"" and was able to put this theory into practice by in-class labs and outside homework. Even though this past week was very demanding (felt like I was back in college), I believe the time and money invested has and will pay great dividends in my future. Data Science Dojo is definitely worth your time if you are seeking to learn more about data science and machine learning. The curriculum is very well designed covering every aspect of the data science field, and the training is very relevant and the instructors are very knowledgeable. Upon my return to work, I plan on apply the new skills learned this week; I would definitely recommend anyone who wants to serious learn more about data science, this is the training you need to attend.",100,100,100
David Anderson,N/A,10/10/2019,"The staff from the Data Science Dojo presented an excellent course from the Bootcamp preparatory materials, lectures and exercises to the 5-Day Bootcamp itself. The instructors presented the topics in a clear, concise and understandable manner in their online courses and did an even better job in person. Rather than just teaching the students how to use libraries from various programs to perform machine learning tasks, they take the time to teach the theory behind who the libraries and other tools work. While I use statistical analysis techniques on a daily basis as part of my job as a risk manager, the tools I have learned in this course will enable me to provide better information to my clients and deploy predictive models to help them achieve their objectives. The instructors are approachable, personable and experts who can explain complex concepts in an easily understandable manner. I am glad I took this course and am working with my company to get my colleagues into the class as well. The cooperative efforts between the University of New Mexico's Continuing Education department and Data Science Dojo provided the additional benefit of making this a university accredited certificate at this location. The level of instruction, the depth of the course materials and the in-class exercises throughout the course make this a great way to learn data science. Their online learning platform and instructor assistance, both of which remain available to alumni after the course, make this a learning experience well worth the time and money.",94,100,80
Robert Taylor,N/A,10/10/2019,"I attended the Data Science Dojo data science bootcamp in Albuquerque, NM in October 2019. This was an outstanding learning experience. I had previously completed an IBM data science certificate on coursera. However, I still struggled with concepts and applications. I signed up for the bootcamp with the hope of enhancing my skills and ability to apply the knowledge. I was not disappointed! The instructors were extremely knowledgeable, patient, engaging, and lively. I truly can not believe the breadth of knowledge and skills I have gained in just 1 week. I now know that, not only am I well versed in various algorithms, techniques, and metrics, but more importantly I KNOW HOW TO APPLY IT in the real world. How do I know this? Well, with what I've learned I'm already exploring, analyzing, and predicting with my own real world data from my laboratory. Things that I use to struggle through are now easily implemented and explored. If you're thinking of signing up for a DS bootcamp, Data Science Dojo is the way to go! Great experience!Ohdid I mentioned that this was an ACCREDITED certificate program in conjunction with the University of New Mexico!? This is the first ACCREDITED DS certificate from a bootcamp that I'm aware of. You shouldn't hesitate to sign up for a Data Science Dojo bootcamp. It will be an extremely rewarding experience!",94,100,80
Divyesh Sharma,N/A,9/21/2019,"I normally dont give out reviews for restaurants, places of visit or movies etc but i could not skip this one! Data Science Dojo 5 day bootcamp has everything for you to fall in love with data. The 5 day course is very well structured and articulated so brilliantly by Raja and his team! There is no pre-requisite to be in this bootcamp as far as programming or data science knowledge goes. The 5 day bootcamp goes through some real life data discussion and how to harness the power within data to make business decision while learning tools such as Azure, R and big data tools. It is an intense bootcamp from 8 to 6 followed by homework. The bootcamp will instill a sense of challenge in you and by the time its done you will be addicted to all concepts of data science be it data exploration and feature engineering, modelling and optimizing your model for making prediction on business outcome, big data and IoT and all the tools that go along to achieve the outcome. The team is very knowledgeable and the course content and Raja is very engaging. In my 50 hours of bootcamp i never felt loosing focus for a bit. If cruising altitude is the limit for datascience the course has already launched me to get to the cruising altitude and have become 50 hours wiser in data science. Thank you Raja and Team. I would recommend this course to all my peers, friends and family. Keep up the great work and educating people!",100,100,NA
David Blauvelt,N/A,9/20/2019,"The Data Science Dojo lives up to its slogan, Data Science is for everyone. The 5 day course is 10 hours per day (with a ~5 hour pre-course materials and homework throughout the 5 days) and lets you get hands on with Data Science tools. For non-coders, you can use Azure ML and for coders while the course examples are taught in R, there are equivalent code sets for Python. While the days are long, they provide you with enough tools and information to go back to your office and start working on your tools and learn where to find questions down the road. The course content is very well organized (you bring your laptop but you don't have to install anything; you can run code through web interfaces). In the past, while I had dabbled with R and Python, I had never been able to harness the end-to-end applications of data science tools showcased in the course (including, deploying and setting up web services). I would recommend that managers should also attend the course as there is a focus on the implications of the outputs of the tools, including precision, accuracy and recall.",100,100,NA
Jon Neill,N/A,9/20/2019,Overall this was a great intro - well balanced between theory and practice. Raja's ability to explain these complex topics in a easy to understand way is fantastic and the overall team is very knowledgeable. This bootcamp is probably better suited for someone who wants to be a data science practitioner - but I also found it valuable as a manager who wants to understand the overall data science landscape. Overall .. highly recommended.,100,100,100
Wasif Huda,N/A,9/19/2019,"I have been working with in data for the last 10 years, working with transforming and reporting data in analytics. I decided to take this course to improve and look at data from a different perspective. I absolutely loved the course, the topics were carefully chosen, each topic had plenty of examples and building examples in R and Azure ML. I did a lot of research for data science course, and this by far has the best crafted curriculum. This is the only on-site data science training where the teachers and TA are on-site to help and encourage open questions. They come early and stay late and very responsive to feedback and really do care that everyone in the room understands. I feel confident after taking this course to go back and provide immediate business value for my company",100,100,100
Anonymous,N/A,9/19/2019,"Very in-depth, easy to understand as long as the pre-course work is completed. Fun competitions, lots of help with extra instructors in the room, personalized support before & after class. Definitely feel confident walking out each day.",100,100,100
Alissa Brayley,N/A,8/27/2019,I recently graduated from school and am working as a Retail Scientist. I attended this bootcamp to help me transition from the research environment to corporate where I'm working with very large data sets. I came into the camp with an understanding of ML algorithms but no practical experience. This course helped me gain confidence implementing ML models. I now feel as though I understand how select features and tune model parameters. The instructors were amazing and provided amazing resources.,100,100,NA
Dalia Sherif,N/A,8/24/2019,"The main benefit of attending this camp was getting a clearer idea about data science and the role of data scientist in general. It was an eye opener and helpful in so many ways. It emphasized the value of the practical experience in any domain, and how to effectively utilize data science techniques to achieve the desired business goals. The curriculum was well designed and the instructors, Raja and Rebecca, did their best to explain key concepts and keep up the class at the same level as much as possible. The Kaggle competition was a smart way to keep the class engaged and make sure that we apply the concepts learnt and get the most out of this experience. The discussions were always rich and I really benefited from the smart professionals in the class! Thank you for amazing experience..",100,100,NA
sachin sharma,N/A,8/23/2019,"The curriculum has been designed in a way that everyone stands to benefit from joining this course whether you are a developer, manager or an existing data scientist. Raja's explanations are very detailed and helps you understand the complexities around various algorithms very well which is something you may not be able to understand outside. Kaggle competition makes sure that you stay engaged for a good part of 5 days and improve your skills side by side.",100,100,NA
Erika Rasmussen,N/A,8/23/2019,"I started the course as the person with the least data science & programming knowledge, by far. Despite my worries, I found that I was able to stay engaged all week. By learning from the instructors and interacting with the other students, I was able to really comprehend the methodologies in a way that would never have been possible while learning independently at home. I am so excited to go home and continue my studies and my relationships with my fellow alumnae. My gratitude to Raja and Rebecca stretches to infinity.",100,100,100
Anonymous,N/A,7/16/2019,"What impressed me most is the instructor Raja. He is brilliant! He makes a data science course more interactive than I can image. We were completely comfortable to ask any questions at any point of time. No matter how ""dump"" or ambiguous the questions were, Raja could tackle it well, considering both technical and business perspectives. I love the way he articulated technical concepts - simple & intuitive languages makes it effective. This inspired me. As a person, Raja is full of passion, patience and sincerity. In addition, I enjoyed the ""open discussion"" atmosphere in the class. In the 5 days spent together, I got to know smart & decent classmates. All are nice memories!",100,100,100
Courtney Regan,N/A,7/16/2019,"This boot camp was one of the best training opportunities I have ever had. what would have taken me months to get across in online tutorials or other modes was demonstrated in one week. I have confidence in adopting ML methods in my work and renewed understanding of data analysis concepts. In fact I wish I had had the chance to attend before beginning graduate school and would recommend anyone about to undertake and advanced degree or PhD that relies on data analysis to consider attending. You wont regret it.Raja and Rebecca have deep knowledge in data science, machine learning and statistics and more importantly they impart that knowledge so effectively and easily to students. They go above and beyond to ensure foundational concepts are well understood before trying to implement them, which means as a student you not only know how to implement the tools, but why you are using them. I can't recommend the boot camp highly enough. make no mistake, it is intense, the hours long and some of the concepts challenging , but the rewards for your efforts more than outweigh the work you will put in.",100,100,NA
Bronson,Supply Chain Planner,7/13/2019,"The 5-day bootcamp held in Singapore was my first foray into data science or anything programming-related in fact. There was a diverse group of attendees from different industries, professions and countries. Although they commit to cover a lot for a 5-day bootcamp, the team at Data Science Dojo have delivered on their commitments. Raja was skilled in drilling down to the concepts behind the hottest topic today, engaging constantly to check for understanding. The exercises were well-designed to enforce learning and the competition element certainly pushed everyone to improve their skills day-to-day. There were no attempts to fluff things up. On the contrary, Raja together with Rebecca have demystified data science for me and made it much more accessible.",100,100,NA
Mohammed Al Abdulal,Business Analytics,7/12/2019,"we spend s days bootcamp in singapore and end up with huge knowledge in statistical, prediction, & create models, and we got basic knowledge in R language and how to build a model even if you have zero knowledge in programming structure. Raja and Ranicca are amazing presenters. I recommend people to register for data science",100,100,100
HT,"VP, Retail Analytics",7/12/2019,"I am working in the field of AI/Big Data but I work on the business side. My goal is not to perform data science analytics myself but to make sure I understand what my colleagues are doing, what is technically possible, and how it impacts business strategy and outcomes. Other data science bootcamps focus mainly on using the tools. e.g., how to use R. Which is great if you are intending to become a data scientist.Data Science Dojo has ~50% focused on the actual coding, but a large proportion on going in depth around the principles behind the code, which allows you to understand the mechanics and pitfalls. The instructor, Raja, makes the course very interactive to force the class to reason as you go through the course.",100,100,100
Rosa Antonini,Data Architect and BI product Engineer,7/8/2019,"I just attended Data Science bootcamp with Data Science Dojo and I am thrilled to say that this course matched my high expectations. I did a lot of research on different data science and machine learning courses and decided to take this one based on its great reputation.The bootcamp was as an amazing, fast paced and challenging experience for me. The teachers are very smart and knowledgeable with hands-on experience using the latest technologies. The amount of material was very extensive, the depth and breadth on each topic of data science and machine learning was superb. There was serious work to be done which included both pre and post work to the bootcamp. I am very grateful to the Data Science Dojo team for their passion and commitment to provide such a wonderful product and high quality of work.",100,100,100
John Spencer,Sr. Research Specialist,7/6/2019,"I attended the New York City bootcamp in June 2019. First off the schedule is intense, 8 am- 6pm for five days. They tell you on the first day that 6pm doesn't mean 5:30 or 5:45, it means 6:00 and sure enough we went to 6:00 every day. Those long days mean they can cover a lot of material. I was worried that depth would be sacrificed in exchange for it being a one week course, however that was not the case. I found the materials to have an appropriate amount of depth to understand the concepts. That said, any data science course (whether 5 days or 5 months) can only take someone so far the rest is up to them to practice and develop the skills once the course ends. For those who are motivated to continue practicing after the bootcamp is over these 5 days will get you off to a good start. The instructors are very good and do a good job presenting material that can be challenging to grasp. There are hands-on exercises in addition to the lectures so there's a chance to use R or Azure to apply what you've been learning about. Bottom line is in 5 days I learned what other courses and bootcamps cover in 8-12 weeks. I highly recommend it for anyone who is interested in building data science skills.",100,100,NA
Sienna,Operational Research Analyst,7/1/2019,"My husband bought this class for me as our anniversary gift, at the beginning I laughed about it, but in the end, I sincerely believe this was a great gift! We both had a good time in class, and we loved it!Here is a breakdown of my review:Content 4/5 Difficulty Level 3/5Classroom Experience 5/5Value 5/5Location 5/5Bottom line: I would highly recommend this class to my friends and my coworkers#Content 4/5#The boot camp mapped out the entire Machine Learning Life cycle. You can immediately apply the content to your work/project and start using the tools and models. Of course, you still need to learn more techniques that are more specific to your business/problem, but the boot camp at least showed you where to start and what may be relevant for you to learn to solve the specific problem. We did not have time to cover regression, IoT, Block Chain in class, but have learning materials regarding the topics that you can learn by yourself after the boot camp.#Difficulty Level 3/5#This is not an advanced statistic class or coding class, and most of the content is not new to people who had statistic class before. Even for people who are new to the topic, Raja made it reasonably easy to understand. I talked to some experienced students (including PhDs in the statistic field), and they also agreed that even though the content was not new to them, but the way it was structured and being able to relate to the real world business cases made this class unique and valuble.#Classroom Experience 5/5#Not only Raja is a great instructor, but also his team provided excellent support whenever we had questions. Another critical factor is the diversity of the students. There were Executives, engineers, developers, analysts, college students, professors, entrepreneurs, social media influencers and they were from all kinds of industries like IT, health care, consulting, manufacturer, energy and all over the globe, Europe, South America, AsianEveryone brought fresh perspectives to the class, and just the networking opportunity itself was well worth the investment.#Value 5/5# I once saw a beginner level two-day PowerBI dashboard training for $3,500, if people pay for that, this boot camp is ten times more worth the money. I went to another data science boot camp in Toronto from a university last year, and it covered almost the same content and cost the same. But that I had no clue how to use what I learned in that class in my work, while after this boot camp I know how to build a useful model, a basic one, but at least I know where to start. The class itself is well worth the money, plus the networking opportunity, the supplement learning materials, and useful souvenirs (each class may be different, but ours was a heavy-duty backpack, notebook, coffee mug, and bunch of other stuff). I would have taken the course even the price tag was higher.#Location 5/5#We took the 2019 New York City class, and it was located on the 59th street, between Park Ave and Lexington Ave. There were many options for hotels and Air BnB. There was a subway station right outside the building, but we didn't use it since it was 5 mins talk to where we stayed.#Other thoughts#The content of the class was not new to me, and I learned the most of the statistics concepts in undergraduate, graduate classes, and another data science boot camp. But this was the first time I felt I could actually use those models, codes, and tools to build something real and use them at work. The instructor, Raja, was putting more emphasis on letting us understand how the theory/model works for businesses and why it works/not works. He did not make us memories the code but pushed us very hard to truly understand the statistical as well as the business logic behind the code/model.The boot camp itself was five days (50 hours), but there are pre-boot camp learning sessions(which were very helpful) and after boot camp learning materials. Plus they also gave us several physical books along with the PDF versions. So leave extra luggage room for the books and gifts if you need to travel to another city to attend the class.",100,100,100
Tessa,VP UX & Tech,6/28/2019,"The instructors do an awesome job making the foundational principles accessible and training you in tools that challenge and allow you to explore your data skills - whether your a novice or advanced. I would recommend this bootcamp for anyone working with data in any capacity. Your understanding of the process and what is actually happening in different learning methods, algorithms and libraries will make you a better at identifying and solving the real-world business problems.",100,100,100
Patty,"Director, Integrated Marketing Strategy",6/27/2019,"I am a marketing professional with zero background in coding/development or data science. Our marketing agency wanted to better-understand Machine Learning to educate our clients and prospects about practical business and marketing applications, so a few of us attended this course. The instructors were incredible and able to break down complex concepts into easy learning material. The days were intense (8 hours + homework), but the learning was invaluable. I was able to learn R and Azure from zero, and now can build predictive models, recommendation models, and machine learning models! This course will not only give you the skills to build and apply models, but ensures you know PRACTICAL application and how to think through model application from a strategic business perspective.",100,100,NA
Agnieszka,N/A,6/27/2019,"Very intersting areas of data science and machine learning are introduced. The instructor takes care about every detail, explaines everything on real examples. It is not only a theory, big part of the course is the practical implementation. All needed basics for understanding the machine learning in data analyses are presented. Definitely, it is worth to attend this course!",100,100,NA
Tomek,Software Developer,6/27/2019,"This data science course is very professionally prepared. There are a lot of additional materials and examples of solving real problems. Everything is well explained. The best choice for people who are, or who be a data scientists!",100,100,NA
Shong (Sean) Chong,N/A,5/8/2019,"Instructor -great explainer: intuitive, highly relatable real-life example are used to walk through difficult concepts and to engage class in discussions. This is typically done PRIOR to reviewing the course material which makes it much easier to understand and grasp the concept-the instructor is deep in a highly technical space yet clearly passionate about teaching, it showsBootcamp-Covers all core key concepts in data science-Walk through examples in depth to illustrate the family of algorithm (teaches how to fish)-Start with theory then immediately pivot to real life applications and practices. -e.g. the emphasis on starting business problem first before translating it into a DS problem. -Kaggle competition keeps it fresh and excitingIf you decide to attend – do the prep work and commit the full week to get the most out of it. As stated in the other review -- “be prepared, the program is definitely intensive, and you get the most if you're prepared and willing to dive in and invest time in practicing these skills.”",100,100,NA
Minnie Cui,Data Analytics Manager,4/25/2019,"The 5-day bootcamp is done extremely well. All the key topics of data science covered in a digestible way, with a strong emphasis on business application. The level of participants is also top notch - data science and related business professionals from a breadth of companies/industries, which boosts the quality of our discussions and the overall experience.From my own experience, I think you'd get the most value of this program if you're at least conversant in basic statistics or have a basic level of exposure to data science concepts. In my opinion, it is ABSOLUTELY worth the investment of ~$3k and a week's worth of time, which is more feasible for company training vs. an extended 10-12 week part time program (which I've also done before).Be prepared though, the program is definitely intensive, and you get the most if you're prepared and willing to dive in and invest time in practicing these skills.",100,100,NA
Anonymous,BI Developer,4/16/2019,I don't usually leave reviews but this was by far the best training I have had outside academia. Raja is extremely talented at explaining machine learning algorithms in a way that makes them intuitive and allows one to understand how to tune the modules which employ them (while also emphasizing the need to look carefully at the big picture before getting into the weeds); certainly one of the most talented teachers I've ever seen.,100,100,NA
Chirranjeevi,Research Scientist,4/13/2019,"I really enjoyed attending the bootcamp, particularly for its focus on business value of data and a foundational understanding of various ML algorithms. Raja is an excellent teacher, and kept the class engaged for the entirety of the five 10 hr days despite the intensity and volume of material being covered. Despite having had fairly good exposure to statistics and ML, it was instructive to have Raja explain the key mathematical constructs behind classification and regression models, model evaluation metrics, over-fitting and bias-variance trade-off -- all in the context of real business situations. The course struck a good balance between breadth and depth of content, and provides a good platform for anyone to start incorporating data-science into their business.",100,100,100
"Kalin Yanev, PhD",Research Director,3/5/2019,"Data Science Dojo is an amazing boot camp - it worked perfectly well for me although I have certain background in statistical analyses and ml itself. I believe that same would apply to various types of experts (as it was the case within my class itself) since the camp provides a very ambitious overview of almost anything that may concern a data scientist - from the field itself to the important big data engineering.On top of this, the vast content is easy to follow thanks to the excellent teaching style of Raja. He starts from the basic logic and it helps enormously. I have been recruiting data scientists myself and have seen that this is a key point to differentiate someone who can do the job or is not more than just technically aware. (From now on if I meet someone who has passed the dsd boot camp, I would know with 0.05 probability of type I error that I have an expert in front of me.)Raja is able to comprehend and react to any eccentric question which I believe is so very important for any teacher.At the final day of the camp (no matter that exhausted from my late night attempts to cheat on the Kaggle competition) I was realizing that I do not want the camp to finish (and it had nothing to do with going back to work or the good food).Highly recommended.",100,100,100
Joe Hootman,N/A,2/12/2019,"The DSD team pulled off a successful balance of two things: packing as much content as possible into the bootcamp and making sure you understood and absorbed it (not just firehosing it). I remember one particular session where the instructor deliberately and patiently asked and answered questions over and over again to make sure that no one left the room with a key concept zooming over our heads. I left with a lot of fresh questions to explore on my own, and you have to own your own journey, but got led a good way down the path by the team. I particularly appreciate DSD's values in hosting an after-hours broader conversation about using AI for social good and for accommodating the challenges of my disability.",100,100,NA
Rick Barnes,Systems Engineer,2/1/2019,"This course brings clarity to the theory and practice of data science. What was once in my periphery is now much more in focus. There is much attention to detail. The venue choice, course material, and instructors were top notch. There is still much to learn, but I cannot imagine packing more into 1 week.",100,100,100
Hamad,Vice GM,11/30/2018,Its amazingly comprehensive and detailed at the same time. Must do if you want to start your career in data science,100,100,100
Nancy forero,PM,6/29/2018,"This is not for the faint of heart. It goes deep into algorithms, data, metrics, comparisons, etc. One of the most valuable things I learned was to ask smart questions and to think in a data analysis machine learning kind of way. Raja was fantastic. And the support team patient and helpful. It’s very intensive 5 days with over 10 hours a day of work. You will definitely get your money’s worth if you do the assignments. You will also learn even if you don’t do them but not as much. It’d be great if the summaries they send at the end of the day were a little bit more detailed. I recommend this wholeheartedly. And I will look for other classes they teach to maybe go deeper into some topics if I want to. Food at the Marriott was SO good, too.",100,100,100
Alhanouf Alsuwailem,Data Analyst,4/15/2018,"It was a great week, I really recommend it to anyone who is looking for a short term course with a full coverage experience in Data Science and Data Engineering. Also, Mr.Raja the BEST instructor I had ever seen.Thank you all.",100,100,100
Hussain Shabbir,N/A,4/14/2018,The focus of the bootcamp is on the application side of data science. It is very easy for a new data scientist to create models irrelevant to its business case. This bootcamp emphasizes the use of machine learning tools with the business case it is meant to serve. The instructor Raja Iqbal is obviously very passionate about teaching,94,100,80
Anonymous,Business Intelligent expert,4/12/2018,"A great boot camp , Mr Raja was the best instructor I had ever seen Respect",100,100,100
saud,N/A,4/12/2018,"I really liked the bootcamp, its has so much valuable inputs and ideas that lets you sharpen your data science skills.I highly recommend it.",100,100,100
Ismail Al Kamal,Director,4/12/2018,The course is perfect for busy business professionals wanting to get hands on experience on a broad set of data science topics. I definitely recommend it.,100,100,NA
Andres de Frutos,Business Development Manager,3/5/2018,"I attended Raja Iqbal's Data Science Dojo bootcamp in Washington DC last week. Even though 10 hours a fay can get very intense, Raja did a great job keeping the full group engaged and focused. The content of the course revolves a lot around the theory around data science and its applications. Before taking the course one has to complete serveral preliminary modules that teach some of the tools used during the class. You will learn the tools again during the bootcamp, but again, the focus is on the theory and application of what has to be done to extract useful algorithms from your data. Raja and his team were great. The food was very high quality and the bootcamp is a great opportunity to network with driven professionals.",74,100,20
Anonymous,Senior Analyst,1/17/2018,"I attended the 2018 Seattle Bootcamp. In summary, it was a decent data science training if you do not have much experience. Here are my thoughts on pros and cons.Pros:1. Good course materials and learning portal to guide learning2. Kaggle competition with others in the bootcamp to learn from each other3. Good networking opportunity4. General good basic knowledge in data science5. Ability and confidence to build simple models at the end of bootcampCons:1. Very focused on Microsoft products (bootcamp appears to be sponsored by Microsoft). Difficult to translate learning on Microsoft into practice if your workplace doesn't use Microsoft ML Studio.2. Questions in class are often deferred to later and not addressed.3. Sometimes discussions were long and not productive, which caused some topics to be not covered.4. Spent time on high school math, which was not engaging and caused delay in bootcamp progress.5. Data engineering topics were poorly explained.",60,80,40
Anonymous,N/A,1/12/2018,In depth review and hands on training required to be a data scientist is learnt in five days. Great content and helpful community.,100,100,NA
"Shyam Srikumar, CSCP",Sr. Principal Consultant,11/13/2017,"The 5 days transform you into a data scientist. Great curriculum as in the selection of topics, progression in topics, right mix of practice vs lecture, right mix of complexity vs practical use and finally the kaggle competition. Raja was a great trainer and very affable. I enjoyed learning the math behind decision trees, random forest, and linear regression. I am now confident to make the switch to Data Science!",100,100,NA
Anonymous,CRM Data Manager,11/12/2017,I don’t currently need these skills at my job but I’ve always wanted to learn more about predictive analysis. The opportunity came for me to take time off work and fly out to Chicago for this boot camp. It was BRUTAL because of a long day of class then homework at night but it was SO WORTH IT! I didn’t become an expert in a week but I learned a lot of tools and how they are being used in the Data Science field. I was able to build a ML predictive model in Azure with a R script embedded which was so cool because I knew neither tools before the boot camp.,86,100,60
LMuro,N/A,10/29/2017,"I attended the Data Science Bootcamp in Singapore (October 2017). I have to say that I thoroughly enjoyed the experience.The bootcamp was intense and fun. Raja, Bryan and Vicky were very knowledageable and they were always available to help us.The lessons had the perfect balance between theory and practice. The math behind most algorithms was explained in a very intuitive way and practical advice was given at all times.At the end of the bootcamp I felt that I learnt ""enough to be dangerous"". With the knowledge acquired I feel confident to continue my machine learning journey by my own (although Data Science Dojo will always be there to help).Thank you!",100,100,100
Matt L.,Lead Data Architect,10/17/2017,"The data science bootcamp was a great experience and is suitable for a broad range of backgrounds and skills. The curriculum is an ideal balance of theory and hands-on labs, while a lot of time is devoted to the challenging areas of data science involving judgement and intuition such as selecting and evaluating models. The techniques taught are technology neutral and the hands-on work incorporated a great mix of leading technologies. Raja, Artem, and the rest of the team are great instructors and were very eager to help and share their valuable experience. A lot of material was packed into 5 days and the pre-bootcamp assignments provide everyone with a common foundation. The code samples, instructional videos, and other material are extensive and serve as valuable reference information beyond the bootcamp.",100,100,100
Anonymous,Financial Economist,10/6/2017,After the bootcamp I feel so much more confident in data science analytics and I feel ready and excited to learn more going forward!,100,100,100
Jay Yamakawa,Crew Member,9/22/2017,"As someone who doesn't currently work in the tech industry, this course offers people a great introduction to the amazing tools that are available in the world of data today. I highly recommend it to anybody who is curious about data science, whether you currently work with data or are looking to break in to the world of data. Data Science Dojo will help you understand how to solve many of the problems that confront businesses today",100,100,NA
Angel Hernandez,Data Analyst,9/15/2017,"While the boot camp emphasizes welcoming all backgrounds in the field of data science, this is an excellent course for those data analysts who are looking to evolve their current skill set, to that of, a data scientist. During the duration of the boot camp, instructor(s) will provide intuitive and in-depth material on the theory and application of machine learning and data science. Then, you will have the opportunity to build on your understanding with in-class practice with predictive modeling, unsupervised learning and big data applications. All of this is being presented in a collaborative environment and with instructors who are experienced practitioners and excellent teachers. I came into the boot camp intimidated by the complexity of data science but left with new found confidence, knowing I now possess a comprehensive understanding of machine learning and eagly await to apply my new skill set in real-world applications.",100,100,100
Mirim,Analytics Manager,7/29/2017,"I just attended Data Science Bootcamp organized by Data Science Dojo in New York in July 2017. I wasn't sure how they would cover all the topics in the curriculum within 5 days. But to my surprise, they exceeded my expectation. I thoroughly enjoyed the intense, yet fun learning at the bootcamp. I am now well-equipped with data science terminologies and underlying statistical fundamentals. They provided good pre-bootcamp materials before the camp and had well-organized content during 5-day camp! Two instructors were very knowledgeable and generous with their times. Thank you!",100,100,NA
Stewart,BI MAnager,7/28/2017,I attended the NYC 2017 Data Science Dojo Bootcamp. Dave and Phuc are a great combination of deep data science and experience that work well together in a class setting. They turned what can be a very dry subject matter into a lively discussion and interesting for the students. Highly recommend this class but come prepared and definitely do the pre course work!!,100,100,100
Sultan Alharbi,N/A,7/28/2017,"I really loved the bootcamp, it's really interesting and exciting. Also, the delivery was absolutely perfect. I especially appreciated the knowledge and the material provided.",100,100,100
John C.,N/A,6/23/2017,"The bootcamp was wonderful and good for Data professionals at all levels. I can't believe how much content we were able to cover in 5 days. Also the class labs and homework were excellent and really helped me get a grasp on the key concepts. It was super exciting and satisfying to be able to build, tune, and deploy a machine learning model in a matter of days!",80,80,80
Darren Lacy,Principal Architect,5/26/2017,"Raja and the team at Data Science Dojo have put together an intensive 5-day bootcamp that can literally accelerate you from the starting line to practitioner in one week. As a long-time BI professional, I have attended many conferences and professional trainings during my career, and hyperbole aside, none were as effective or rewarding as the DSD bootcamp. Much will be demanded of you, between hours of pre-work, 10 hours per day of classroom instruction (minus lunch and a few breaks), homework, a capstone project, and follow-up activities if you so choose, but what a payout. As a classically trained musician in a former life, I know you get to Carnegie Hall by practice. The same is true of data science! As you gain a trove of knowledge throughout the class, the veil of mystery surrounding data science will be lifted before your eyes, and you will realize that you can do this. If you are serious about finally adding these skills to your arsenal, there is no better way than to attend the Data Science Dojo Bootcamp. Sign up today. That is all!",100,100,100
Anonymous,Analyst,5/12/2017,"I highly recommend this bootcamp to anybody who is trying to get a background in data science. I come from a statistical and research background and was considering going back to school to get a masters in data science to help me advance in this industry, but I believe I have gained all of the skills necessary to get me started (all in 5 DAYS!). Thank you David and Phuc!",100,100,100
Emanuele,Program Manager,4/27/2017,"If you are looking for a comprehensive course you can definitely approach Data Science Dojo. Delivering such a 'big data' of knowledge in 5 days can be done only if you know the product upside down. Well, DSD trainers proved to be up to the task with an excellent success rate!",100,100,NA
Anonymous,N/A,3/17/2017,"The course is very practical and you get a lot out of it. Comes with breaks in between and good food and snacks, and very kind instructors that are passionate and willing to help.",100,100,NA
Mac,Director of Analytics,3/16/2017,"The instructors, content and pace are terrific. I am experienced in data science and not only was the boot camp a nice hands-on refresher, it was an ability to focus for a week, build and improve models in R and deploy them in Azure and cover many other relevant topics along the way. I highly recommend this camp to anyone who wants to improve their skills or want to learn more about data science from highly qualified and effective instructors.",80,80,80
Allison,N/A,2/13/2017,"I really enjoyed this boot camp. The instructors are enthusiastic and the whole week is very interactive and hands on. I'm just beginning my data science journey, but I am excited to implement some of the skills learned in Austin this week.",100,100,NA
Jess,Lead Research Analyst,1/30/2017,"I've been back at work for a bit over a week since the bootcamp, and I can confidently say that this training was amazing, and it's made a huge difference in my ability to do my job. I come from a strong statistical and research-oriented background, but it's primarily academic in nature. So I wasn't super aware of machine learning, data science terminology, etc. But after a week, not only do I have a stronger understanding of data science ""lingo,"" but I also came away with a number of new methodologies that I can start implementing immediately. It was definitely an intense week (10 hour days plus homework), but the pace of the class was great, the length of each class wasn't terrible, and we were fed well and had a number of breaks. We were given just enough theory for us to understand the methods we were using, but it wasn't an overly-theoretical course. It got a bit math heavy there for a bit (regressions, I'm lookin at you), but I like math and didn't mind the calculus refresher. For those of you that aren't math-heavy, you'll get through it, just try and understand the general concepts. I loved that the course was R-centric - I've been trying to make the switch to R, and being forced to code in it for most of the week was great. My only complaint here is that we spent a lot of time doing some things in AzureML, and at the end of the day, I won't have access to that at work, so it would have been nice to spend at least a bit more time in R. The good thing is they provided lots and lots of documentation, so there is code hanging around for most of the things we did in Azure. I also loved the in-class kaggle competition. It was great to be able to use these tools and actually apply it to a real world dataset, and the competition aspect made it fun. Finally, I loved the approach that Raja and Phuc took with the course - just understand the basics to evaluating the model. Even if you can't detail all the differences between random forests and random jungles, you at least come away knowing how to evaluate those models and when to use them. Because of that approach, I came back to work with a ton of tools that I can begin using immediately. This is amazing, since a few other online courses I took really just left you with a bunch of tools that you have no idea what to do with once the class ends. Overall, I can't recommend this course highly enough. Not only was the course amazing, but I have a support network of likeminded folks (including the instructors) that I'm sure will be an invaluable resource as I move on.",100,100,NA
Anonymous,Junior Data Scientist,1/16/2017,"This was a very well organized bootcamp. I spent several months before the bootcamp taking various online courses, and it was difficult to know where to begin and where to focus my efforts: statistics, mathematics, machine learning, python, r, data engineering, etc. This bootcamp did an excellent job of cutting through the noise to get down to what is really important to get started in the field. They emphasize the fundamental concepts rather than the specific tools to use. During the bootcamp we used R, but we were provided the equivalent python code. The instructors took the time to answer all of our questions and made sure everyone understood the concepts before moving on. In addition, they offer support post-bootcamp and mentorship for a kaggle competition. Overall, I was very happy with the content and organization of the bootcamp. Many online courses teach the concepts well, but afterwords it is difficult to apply those concepts to real world scenarios and messy data. The Data Science Dojo bootcamp helped me bridge the gap between concept and reality. It is a tough week (8am-6pm M-F plus homework), but it is well worth it.",86,100,60
Anonymous,N/A,11/28/2016,"I joined the DSD bootcamp in Paris in November. I really enjoyed how the bootcamp was structured - getting for a feel for the data first through different types of visualisation and then building up our intuition for how the models work before writing the code. Then gradually building up complexity - introducing the pitfalls and then how they are currently solved. A good mix of theory and hands-on practice. We entered a Kaggle competition so we know how to do it now, and also employed the Internet of Things on the last day! I was also impressed by how the instructors took the time to answer all our questions and make sure that we all understood - given how 5 days isn't long! The support afterwards is also very good - I'm now working through a fat workbook! I funded my own place on the camp (i.e. my work didn't pay for it) and I was very happy with it.",86,100,60
Pier Lorenzo,"a software ""Generalist"" with a data science twist",11/21/2016,"I have just attended the DatascienceDojo bootcamp in Paris and, honestly, it has been amazing week: a wonderful adventure. The bootcamp is quite intense and demanding, it takes you through the different steps of the data science process building up the relevant knowledge to start working in on data science tasks. You learns how to leverage technologies, like the R ecosystem or the Azure ecosystem, to create data analysis and products. Lots of valuable insights, met wonderful and interesting people, good discussions and lots of fun. At the end of the bootcamp you feel completely energised and ready to continue the data science journey wherever it takes.",100,100,100
{
"cells": [
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "FrYi4yUsMmnt"
},
"outputs": [],
"source": [
"# $ python -m pip install beautifulsoup4"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "Q-YwFh4jMmn1"
},
"source": [
"#### Creating dataframe for review data"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "9CuQIZRtMmn4"
},
"outputs": [],
"source": [
"import requests\n",
"from bs4 import BeautifulSoup\n",
"\n",
"\n",
"URL = \"https://www.switchup.org/bootcamps/data-science-dojo?page=\"\n",
"page = requests.get(URL)\n",
"soup = BeautifulSoup(page.content, \"html.parser\")"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "HZXL2oPPMmn6"
},
"source": [
"#### Get number of pages for reviews by iterating through all review pages and building of list of review page urls."
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "R4HpMZFhMmn7",
"outputId": "535e4b46-86a4-4e84-dad7-e6ab7455f667"
},
"outputs": [],
"source": [
"n=1\n",
"while True:\n",
" response = requests.get(URL + str(n), allow_redirects=False)\n",
" if response.status_code == 200:\n",
" n+=1\n",
" else:\n",
" break\n",
"\n",
"main_page_url_list = []\n",
"pages_to_scrape = n\n",
"i = 1\n",
"main_page_url_list = [URL + str(i) for i in range(i,n)]\n",
"print(main_page_url_list) # check"
]
},
{
"cell_type": "code",
"execution_count": 43,
"metadata": {
"id": "iS1OzlNvMmn-"
},
"outputs": [
{
"ename": "PermissionError",
"evalue": "[Errno 13] Permission denied: 'switchup_reviews.csv'",
"output_type": "error",
"traceback": [
"\u001b[1;31m---------------------------------------------------------------------------\u001b[0m",
"\u001b[1;31mPermissionError\u001b[0m Traceback (most recent call last)",
"\u001b[1;32m<ipython-input-43-91bcfe0a16bb>\u001b[0m in \u001b[0;36m<module>\u001b[1;34m\u001b[0m\n\u001b[0;32m 101\u001b[0m \u001b[0mswitchup_reviews\u001b[0m \u001b[1;33m=\u001b[0m \u001b[0mswitchup_reviews\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0mtranspose\u001b[0m\u001b[1;33m(\u001b[0m\u001b[1;33m)\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0;32m 102\u001b[0m \u001b[0mswitchup_reviews\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0mcolumns\u001b[0m\u001b[1;33m=\u001b[0m\u001b[1;33m[\u001b[0m\u001b[1;34m'Name'\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;34m'Job'\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;34m'Date of Review'\u001b[0m\u001b[1;33m,\u001b[0m \u001b[1;34m'Review'\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;34m'Overall Rating'\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;34m'Curriculum Rating'\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;34m'Job Support Rating'\u001b[0m\u001b[1;33m]\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[1;32m--> 103\u001b[1;33m \u001b[0mswitchup_reviews\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0mto_csv\u001b[0m\u001b[1;33m(\u001b[0m\u001b[1;34m'switchup_reviews.csv'\u001b[0m\u001b[1;33m,\u001b[0m \u001b[0mindex\u001b[0m\u001b[1;33m=\u001b[0m\u001b[1;32mFalse\u001b[0m\u001b[1;33m)\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0m\u001b[0;32m 104\u001b[0m \u001b[0mi\u001b[0m\u001b[1;33m+=\u001b[0m\u001b[1;36m1\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n",
"\u001b[1;32mC:\\Python39\\lib\\site-packages\\pandas\\core\\generic.py\u001b[0m in \u001b[0;36mto_csv\u001b[1;34m(self, path_or_buf, sep, na_rep, float_format, columns, header, index, index_label, mode, encoding, compression, quoting, quotechar, line_terminator, chunksize, date_format, doublequote, escapechar, decimal, errors, storage_options)\u001b[0m\n\u001b[0;32m 3385\u001b[0m )\n\u001b[0;32m 3386\u001b[0m \u001b[1;33m\u001b[0m\u001b[0m\n\u001b[1;32m-> 3387\u001b[1;33m return DataFrameRenderer(formatter).to_csv(\n\u001b[0m\u001b[0;32m 3388\u001b[0m \u001b[0mpath_or_buf\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0;32m 3389\u001b[0m \u001b[0mline_terminator\u001b[0m\u001b[1;33m=\u001b[0m\u001b[0mline_terminator\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n",
"\u001b[1;32mC:\\Python39\\lib\\site-packages\\pandas\\io\\formats\\format.py\u001b[0m in \u001b[0;36mto_csv\u001b[1;34m(self, path_or_buf, encoding, sep, columns, index_label, mode, compression, quoting, quotechar, line_terminator, chunksize, date_format, doublequote, escapechar, errors, storage_options)\u001b[0m\n\u001b[0;32m 1081\u001b[0m \u001b[0mformatter\u001b[0m\u001b[1;33m=\u001b[0m\u001b[0mself\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0mfmt\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0;32m 1082\u001b[0m )\n\u001b[1;32m-> 1083\u001b[1;33m \u001b[0mcsv_formatter\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0msave\u001b[0m\u001b[1;33m(\u001b[0m\u001b[1;33m)\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0m\u001b[0;32m 1084\u001b[0m \u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0;32m 1085\u001b[0m \u001b[1;32mif\u001b[0m \u001b[0mcreated_buffer\u001b[0m\u001b[1;33m:\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n",
"\u001b[1;32mC:\\Python39\\lib\\site-packages\\pandas\\io\\formats\\csvs.py\u001b[0m in \u001b[0;36msave\u001b[1;34m(self)\u001b[0m\n\u001b[0;32m 226\u001b[0m \"\"\"\n\u001b[0;32m 227\u001b[0m \u001b[1;31m# apply compression and byte/text conversion\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[1;32m--> 228\u001b[1;33m with get_handle(\n\u001b[0m\u001b[0;32m 229\u001b[0m \u001b[0mself\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0mfilepath_or_buffer\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0;32m 230\u001b[0m \u001b[0mself\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0mmode\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n",
"\u001b[1;32mC:\\Python39\\lib\\site-packages\\pandas\\io\\common.py\u001b[0m in \u001b[0;36mget_handle\u001b[1;34m(path_or_buf, mode, encoding, compression, memory_map, is_text, errors, storage_options)\u001b[0m\n\u001b[0;32m 640\u001b[0m \u001b[0merrors\u001b[0m \u001b[1;33m=\u001b[0m \u001b[1;34m\"replace\"\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0;32m 641\u001b[0m \u001b[1;31m# Encoding\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[1;32m--> 642\u001b[1;33m handle = open(\n\u001b[0m\u001b[0;32m 643\u001b[0m \u001b[0mhandle\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n\u001b[0;32m 644\u001b[0m \u001b[0mioargs\u001b[0m\u001b[1;33m.\u001b[0m\u001b[0mmode\u001b[0m\u001b[1;33m,\u001b[0m\u001b[1;33m\u001b[0m\u001b[1;33m\u001b[0m\u001b[0m\n",
"\u001b[1;31mPermissionError\u001b[0m: [Errno 13] Permission denied: 'switchup_reviews.csv'"
]
}
],
"source": [
"import pandas as pd\n",
"all_reviews = []\n",
" \n",
"name = []\n",
"job = []\n",
"date = []\n",
"review = []\n",
"overall_rating = []\n",
"curriculum_rating = []\n",
"job_support_rating = []\n",
"switchup_reviews = []\n",
"review = []\n",
"\n",
"for x, _ in enumerate(range(1,n)):\n",
"\n",
" url = main_page_url_list[x]\n",
" page = requests.get(url)\n",
" soup = BeautifulSoup(page.content, \"html.parser\")\n",
" pretty_soup = soup.prettify()\n",
" results = soup.find(id=\"reviews\")\n",
" job_elements = results.find_all(\"div\", class_=\"section--white border-grey mdc-layout-grid\") # find all job elements\n",
"\n",
" table=soup.find_all(\"div\",class_=\"mdc-layout-grid__inner border-bottom border-bottom--grey\")\n",
" name_and_job = soup.find_all(\"li\", class_=\"name\")\n",
" review_created_date = soup.find_all(\"p\", class_=\"unset-margin__top subtitle\")\n",
" review_text_trunc = soup.find_all(\"div\", class_=\"review-description\")\n",
" ovr_cirr_job_rating = soup.find_all(\"div\", class_=\"rating-icons__filled\")\n",
"\n",
" review_list = [x.text.replace('Read More','') for x in review_text_trunc[:]]\n",
" review_list = [x.replace('Read Less','') for x in review_list]\n",
" review_list = [x.replace('...','') for x in review_list]\n",
" review_list = [x for x in review_list[:]]\n",
" \n",
" datelist = [x.text for x in review_created_date]\n",
"\n",
" star_rating_list = []\n",
" star_rating_list = [x['style'] for x in ovr_cirr_job_rating]\n",
" star_rating_list = [x.replace('width:','') for x in star_rating_list]\n",
" star_rating_list = [x[0:3] for x in star_rating_list]\n",
" star_rating_list = [x.replace('.','') for x in star_rating_list]\n",
" star_rating_list = [x for x in star_rating_list[2:-7]]\n",
" star_rating_list = [star_rating_list[i:i+3] for i in range(0, len(star_rating_list), 3)]\n",
"\n",
"\n",
" for starrating in star_rating_list:\n",
" overall_rating.append(starrating[0])\n",
" try:\n",
" curriculum_rating.append(starrating[1])\n",
" except:\n",
" curriculum_rating.append('NA')\n",
" try:\n",
" job_support_rating.append(starrating[2])\n",
" except:\n",
" job_support_rating.append('NA')\n",
" # if len(starrating)==3:\n",
" # job_support_rating.append(starrating[2])\n",
" # else:\n",
" # job_support_rating.append('NA')\n",
"\n",
" for i in curriculum_rating:\n",
" if i == '00':\n",
" curriculum_rating[curriculum_rating.index(i)] = 'NA'\n",
" for i in overall_rating:\n",
" if i == '00':\n",
" overall_rating[overall_rating.index(i)] = 'NA'\n",
" for i in job_support_rating:\n",
" if i == '00':\n",
" job_support_rating[job_support_rating.index(i)] = 'NA'\n",
"\n",
" if len(table)==0:\n",
" name_and_job = soup.find_all(\"li\", class_=\"name\")\n",
" # Pull name and job from the list\n",
" nameJob = name_and_job[2]\n",
" for nameJob in name_and_job:\n",
" try:\n",
" names = nameJob.find('span').text[0:-1].strip()\n",
" name.append(names)\n",
" except:\n",
" name = \"N/A\"\n",
" try:\n",
" jobs = nameJob.contents[3].text.strip()\n",
" jobs = jobs[2:].strip()\n",
" job.append(jobs)\n",
" except:\n",
" jobs = \"N/A\"\n",
" job.append(jobs)\n",
" else:\n",
" for row in table:\n",
" name.append(row.find(\"h6\", class_=\"unset-margin__top unset-margin__bottom\").text)\n",
" graduated_and_job=row.find_all(\"span\", class_=\"subtitle\")\n",
" if len(graduated_and_job)==2:\n",
" job.append(graduated_and_job[0].text.split(' |')[0])\n",
" else:\n",
" job.append(\"N/A\")\n",
"\n",
" date.extend(datelist)\n",
" review.extend(review_list)\n",
"\n",
" i=0\n",
" switchup_reviews = pd.DataFrame([name, job, date, review ,overall_rating,curriculum_rating,job_support_rating])\n",
" switchup_reviews = switchup_reviews.transpose()\n",
" switchup_reviews.columns=['Name','Job','Date of Review', 'Review','Overall Rating','Curriculum Rating','Job Support Rating']\n",
" switchup_reviews.to_csv('switchup_reviews.csv', index=False)\n",
" i+=1\n"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "0OWPHUYFMmoA"
},
"outputs": [],
"source": [
"results = soup.find(id=\"reviews\")\n",
"job_elements = results.find_all(\"div\", class_=\"section--white border-grey mdc-layout-grid\")\n",
"\n",
"with open(\"switchup_reviews2.txt\", \"w+\") as j:\n",
" j.write(str(job_elements))\n",
" j.close"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "UF_9MBBRMmoB"
},
"source": [
"#### Variables to be gathered"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "GFFmS_lFMmoD",
"outputId": "4b85390a-3a2b-4339-f57b-348981f8c906"
},
"outputs": [],
"source": [
"name = []\n",
"job = []\n",
"table=soup.find_all(\"div\",class_=\"mdc-layout-grid__inner border-bottom border-bottom--grey\")\n",
"if len(table)==0:\n",
" name_and_job = soup.find_all(\"li\", class_=\"name\")\n",
" # Pull name and job from the list\n",
" nameJob = name_and_job[2]\n",
" for nameJob in name_and_job:\n",
" try:\n",
" names = nameJob.find('span').text[0:-1].strip()\n",
" name.append(names)\n",
" except:\n",
" name = \"N/A\"\n",
" try:\n",
" jobs = nameJob.contents[3].text.strip()\n",
" jobs = jobs[2:].strip()\n",
" job.append(jobs)\n",
" except:\n",
" jobs = \"N/A\"\n",
" job.append(jobs)\n",
"else:\n",
" for row in table:\n",
" name.append(row.find(\"h6\", class_=\"unset-margin__top unset-margin__bottom\").text)\n",
" graduated_and_job=row.find_all(\"span\", class_=\"subtitle\")\n",
" if len(graduated_and_job)==2:\n",
" job.append(graduated_and_job[0].text.split(' |')[0])\n",
" else:\n",
" job.append(\"N/A\")\n",
"print(name)\n",
"print(job)\n",
"review_created_date = soup.find_all(\"p\", class_=\"unset-margin__top subtitle\")\n",
"review_text_trunc = soup.find_all(\"span\", class_=\"truncatable\")\n",
"ovr_cirr_job_rating = soup.find_all(\"div\", class_=\"rating-icons__filled\")"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "5FfzkuLJMmoH"
},
"source": [
"#### Create a list of dates"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "2JwIIhadMmoI",
"outputId": "94f2aa9f-ecff-4664-dfa9-e1ee7eadc4d5"
},
"outputs": [],
"source": [
"datelist = [x.text for x in review_created_date]\n",
"print(datelist)\n",
"print(len(datelist))"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "SCJj5h6VMmoJ"
},
"source": [
"#### Gather Reviews"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "6RmBASnZMmoJ",
"outputId": "2907d340-bdad-4a5c-fca0-3d8db04c52de"
},
"outputs": [],
"source": [
"review_list = [x.text.replace('Read More','') for x in review_text_trunc[1:]]\n",
"review_list = [x.replace('Read Less','') for x in review_list]\n",
"review_list = [x.replace('...','') for x in review_list]\n",
"print(review_list)\n",
"print(len(review_list))"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "lI7nRc19MmoK",
"outputId": "27f45f13-a192-4a16-ba5b-1075c7627d67"
},
"outputs": [],
"source": [
"#NOTE First 4 positions in list are for DSD - 0 = overall, 1 = overall (average), 2 = Curriculum, 3 = job support ||\n",
"#NOTE Currently the last 7 positions are for ads - but that may change - basically anything not divisible by 20 is either DSD overall or an ad. \n",
"#NOTE Switchup only let's people give whole stars. Divide by 20 to get number of stars.\n",
"overall_rating = []\n",
"curriculum_rating = []\n",
"job_support_rating = []\n",
"star_rating_list = [x['style'] for x in ovr_cirr_job_rating]\n",
"star_rating_list = [x.replace('width:','') for x in star_rating_list]\n",
"star_rating_list = [x[0:3] for x in star_rating_list]\n",
"star_rating_list = [x.replace('.','') for x in star_rating_list]\n",
"star_rating_list = [x for x in star_rating_list if x in {'100', '80', '60', '40', '20', '00'}]\n",
"star_rating_list = [star_rating_list[i:i+3] for i in range(0, len(star_rating_list), 3)]\n",
"print(star_rating_list)\n",
"print(len(star_rating_list))\n",
"\n",
"for starrating in star_rating_list:\n",
" overall_rating.append(starrating[0])\n",
" curriculum_rating.append(starrating[1])\n",
" if len(starrating)==3:\n",
" job_support_rating.append(starrating[2])\n",
" else:\n",
" job_support_rating.append('NA')\n",
"for i in curriculum_rating:\n",
" if i == '00':\n",
" curriculum_rating[curriculum_rating.index(i)] = 'NA'\n",
"for i in overall_rating:\n",
" if i == '00':\n",
" overall_rating[overall_rating.index(i)] = 'NA'\n",
"for i in job_support_rating:\n",
" if i == '00':\n",
" job_support_rating[job_support_rating.index(i)] = 'NA'\n",
"print(overall_rating)\n",
"print(curriculum_rating)\n",
"print(job_support_rating)"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "I4gCDNycMmoM"
},
"source": [
"Add each list"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/",
"height": 310
},
"id": "7mr4gM07MmoN",
"outputId": "8a0a10f7-657e-437f-f68a-d3f5539fb214"
},
"outputs": [],
"source": [
"import pandas as pd\n",
"\n",
"switchup_reviews = pd.DataFrame([name, job, datelist, review_list,overall_rating,curriculum_rating,job_support_rating])\n",
"switchup_reviews = switchup_reviews.transpose()\n",
"switchup_reviews.columns=['Name','Job','Date of Review', 'Review','Overall Rating','Curriculum Rating','Job Support Rating']\n",
"switchup_reviews.head()\n"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "bgvBSam4MmoN"
},
"source": [
"#### Saving"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"id": "dSI01rNlMmoO"
},
"outputs": [],
"source": [
"switchup_reviews.to_csv('switchup_reviews.csv', index=False)"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "8Np2hXKvMmoP"
},
"source": [
"# Extra code from other methods"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "dtYADOQTMmoP",
"outputId": "6a3a190b-b587-48a8-bfd0-e4376c2a5a84"
},
"outputs": [],
"source": [
"with open(\"Review page 1.txt\", \"w+\") as f:\n",
" for job_element in job_elements:\n",
" name_and_job = job_element.find(\"li\", class_=\"name\")\n",
" review_created_date = job_element.find(\"div\", class_=\"created-at\")\n",
" review_text_trunc = job_element.find(\"span\", class_=\"truncatable\")\n",
" review_text_more = job_element.find(\"span\", class_=\"read-more\")\n",
" ovr_cirr_job_rating = job_element.find(\"div\", class_=\"rating-icons__filled\")\n",
" line_item = list(\n",
" [\n",
" name_and_job,\n",
" review_created_date,\n",
" review_text_trunc,\n",
" review_text_more,\n",
" ovr_cirr_job_rating,\n",
" ]\n",
" )\n",
" f.write(str(line_item))\n",
" f.write(\"\\n\"*2)\n",
"\n",
"j.close"
]
},
{
"cell_type": "markdown",
"metadata": {
"id": "29BRXnoyMmoQ"
},
"source": [
"#### Extra code"
]
},
{
"cell_type": "code",
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "IhuL9TNNMmoQ",
"outputId": "22dbe11c-ec17-4ed5-e004-83c08a6db316"
},
"outputs": [],
"source": [
"import pandas as pd\n",
"from html.parser import HTMLParser\n",
"\n",
"# Create table parser and extract table data\n",
"class Parser(HTMLParser):\n",
" def __init__(self):\n",
" HTMLParser.__init__(self)\n",
" self.in_td = False\n",
" \n",
" def handle_starttag(self, tag, attrs):\n",
" if tag in ['span', 'p', 'div']:\n",
" self.in_td = True\n",
" \n",
" def handle_data(self, data):\n",
" if self.in_td:\n",
" if data == '\\xa0':\n",
" list.append('') # This is a placeholder for the draft round will fill in later\n",
" else:\n",
" list.append(data)\n",
" \n",
" def handle_endtag(self, tag):\n",
" self.in_td = False\n",
"\n",
"list = []\n",
"data = open('Review page 2.txt', 'r')\n",
"data = data.read()\n",
"p = Parser()\n",
"p.feed(str(data))\n",
"\n",
"print(list)\n",
"\n",
"reviewDetails = [list[x:x+6] for x in range(0, len(list), 6)]\n",
"print(reviewDetails[0])\n",
"print(reviewDetails[1])\n",
"print(reviewDetails[2])\n",
"print(reviewDetails[3])\n"
]
}
],
"metadata": {
"colab": {
"collapsed_sections": [],
"name": "switchup.ipynb",
"provenance": []
},
"interpreter": {
"hash": "63fd5069d213b44bf678585dea6b12cceca9941eaf7f819626cde1f2670de90d"
},
"kernelspec": {
"display_name": "Python 3.9.2 64-bit",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.9.2"
},
"orig_nbformat": 4
},
"nbformat": 4,
"nbformat_minor": 2
}
{ {
"nbformat": 4, "nbformat": 4,
"nbformat_minor": 0, "nbformat_minor": 2,
"metadata": { "metadata": {
"orig_nbformat": 4, "orig_nbformat": 4,
"language_info": { "language_info": {
...@@ -31,207 +31,202 @@ ...@@ -31,207 +31,202 @@
"cells": [ "cells": [
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": null,
"id": "FrYi4yUsMmnt"
},
"source": [ "source": [
"# $ python -m pip install beautifulsoup4" "# $ python -m pip install beautifulsoup4"
], ],
"execution_count": null, "outputs": [],
"outputs": [] "metadata": {
"id": "FrYi4yUsMmnt"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "Q-YwFh4jMmn1"
},
"source": [ "source": [
"#### Creating dataframe for review data" "#### Creating dataframe for review data"
] ],
"metadata": {
"id": "Q-YwFh4jMmn1"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": 1,
"id": "9CuQIZRtMmn4"
},
"source": [ "source": [
"import requests\n", "import requests\r\n",
"from bs4 import BeautifulSoup\n", "from bs4 import BeautifulSoup\r\n",
"\n", "\r\n",
"\n", "\r\n",
"URL = \"https://www.switchup.org/bootcamps/data-science-dojo?page=1\"\n", "URL = \"https://www.switchup.org/bootcamps/data-science-dojo?page=\"\r\n",
"page = requests.get(URL)\n", "page = requests.get(URL)\r\n",
"soup = BeautifulSoup(page.content, \"html.parser\")" "soup = BeautifulSoup(page.content, \"html.parser\")"
], ],
"execution_count": 3, "outputs": [],
"outputs": [] "metadata": {
"id": "9CuQIZRtMmn4"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "HZXL2oPPMmn6"
},
"source": [ "source": [
"#### Get number of pages for reviews by iterating through all review pages and building of list of review page urls." "#### Get number of pages for reviews by iterating through all review pages and building of list of review page urls."
] ],
"metadata": {
"id": "HZXL2oPPMmn6"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": 2,
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "R4HpMZFhMmn7",
"outputId": "535e4b46-86a4-4e84-dad7-e6ab7455f667"
},
"source": [ "source": [
"n=1\n", "n=1\r\n",
"while True:\n", "while True:\r\n",
" response = requests.get(URL + str(n), allow_redirects=False)\n", " response = requests.get(URL + str(n), allow_redirects=False)\r\n",
" if response.status_code == 200:\n", " if response.status_code == 200:\r\n",
" n+=1\n", " n+=1\r\n",
" else:\n", " else:\r\n",
" break\n", " break\r\n",
"\n", "\r\n",
"main_page_url_list = []\n", "main_page_url_list = []\r\n",
"pages_to_scrape = n\n", "pages_to_scrape = n\r\n",
"i = 1\n", "i = 1\r\n",
"main_page_url_list = [URL + str(i) for i in range(i,n)]\n", "main_page_url_list = [URL + str(i) for i in range(i,n)]\r\n",
"print(main_page_url_list) # check" "print(main_page_url_list) # check"
], ],
"execution_count": null,
"outputs": [ "outputs": [
{ {
"output_type": "stream", "output_type": "stream",
"name": "stdout", "name": "stdout",
"text": [ "text": [
"['https://www.switchup.org/bootcamps/data-science-dojo?page=11', 'https://www.switchup.org/bootcamps/data-science-dojo?page=12']\n" "['https://www.switchup.org/bootcamps/data-science-dojo?page=1', 'https://www.switchup.org/bootcamps/data-science-dojo?page=2', 'https://www.switchup.org/bootcamps/data-science-dojo?page=3', 'https://www.switchup.org/bootcamps/data-science-dojo?page=4', 'https://www.switchup.org/bootcamps/data-science-dojo?page=5', 'https://www.switchup.org/bootcamps/data-science-dojo?page=6', 'https://www.switchup.org/bootcamps/data-science-dojo?page=7', 'https://www.switchup.org/bootcamps/data-science-dojo?page=8', 'https://www.switchup.org/bootcamps/data-science-dojo?page=9', 'https://www.switchup.org/bootcamps/data-science-dojo?page=10', 'https://www.switchup.org/bootcamps/data-science-dojo?page=11', 'https://www.switchup.org/bootcamps/data-science-dojo?page=12']\n"
] ]
} }
] ],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "R4HpMZFhMmn7",
"outputId": "535e4b46-86a4-4e84-dad7-e6ab7455f667"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": 3,
"id": "iS1OzlNvMmn-"
},
"source": [ "source": [
"# for x, _ in enumerate(range(1,n)):\n", "for x, _ in enumerate(range(1,n)):\r\n",
"# url = main_page_url_list[]\n", " url = main_page_url_list[x]\r\n",
"page = requests.get('https://www.switchup.org/bootcamps/data-science-dojo?page=2')\n", " page = requests.get(url)\r\n",
"soup = BeautifulSoup(page.content, \"html.parser\")\n", " soup = BeautifulSoup(page.content, \"html.parser\")\r\n",
"pretty_soup = soup.prettify()\n", " pretty_soup = soup.prettify()\r\n",
"with open(\"Review page \" + str(2) + \".txt\", \"w\", encoding=\"utf-8\") as f:\n", " with open(\"Review page \" + str(x) + \".txt\", \"w\", encoding=\"utf-8\") as f:\r\n",
" f.write(str(pretty_soup))" " f.write(str(pretty_soup))\r\n",
" results = soup.find(id=\"reviews\")\r\n",
" job_elements = results.find_all(\"div\", class_=\"section--white border-grey mdc-layout-grid\") # find all job elements"
], ],
"execution_count": 12, "outputs": [],
"outputs": [] "metadata": {
"id": "iS1OzlNvMmn-"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": null,
"id": "0OWPHUYFMmoA"
},
"source": [ "source": [
"results = soup.find(id=\"reviews\")\n", "results = soup.find(id=\"reviews\")\r\n",
"job_elements = results.find_all(\"div\", class_=\"section--white border-grey mdc-layout-grid\")\n", "job_elements = results.find_all(\"div\", class_=\"section--white border-grey mdc-layout-grid\")\r\n",
"\n", "\r\n",
"with open(\"switchup_reviews2.txt\", \"w+\") as j:\n", "with open(\"switchup_reviews2.txt\", \"w+\") as j:\r\n",
" j.write(str(job_elements))\n", " j.write(str(job_elements))\r\n",
" j.close" " j.close"
], ],
"execution_count": null, "outputs": [],
"outputs": [] "metadata": {
"id": "0OWPHUYFMmoA"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "UF_9MBBRMmoB"
},
"source": [ "source": [
"#### Variables to be gathered" "#### Variables to be gathered"
] ],
"metadata": {
"id": "UF_9MBBRMmoB"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": 4,
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "GFFmS_lFMmoD",
"outputId": "4b85390a-3a2b-4339-f57b-348981f8c906"
},
"source": [ "source": [
"name = []\n", "name = []\r\n",
"job = []\n", "job = []\r\n",
"table=soup.find_all(\"div\",class_=\"mdc-layout-grid__inner border-bottom border-bottom--grey\")\n", "table=soup.find_all(\"div\",class_=\"mdc-layout-grid__inner border-bottom border-bottom--grey\")\r\n",
"if len(table)==0:\n", "if len(table)==0:\r\n",
" name_and_job = soup.find_all(\"li\", class_=\"name\")\n", " name_and_job = soup.find_all(\"li\", class_=\"name\")\r\n",
" # Pull name and job from the list\n", " # Pull name and job from the list\r\n",
" nameJob = name_and_job[2]\n", " nameJob = name_and_job[2]\r\n",
" for nameJob in name_and_job:\n", " for nameJob in name_and_job:\r\n",
" try:\n", " try:\r\n",
" names = nameJob.find('span').text[0:-1].strip()\n", " names = nameJob.find('span').text[0:-1].strip()\r\n",
" name.append(names)\n", " name.append(names)\r\n",
" except:\n", " except:\r\n",
" name = \"N/A\"\n", " name = \"N/A\"\r\n",
" try:\n", " try:\r\n",
" jobs = nameJob.contents[3].text.strip()\n", " jobs = nameJob.contents[3].text.strip()\r\n",
" jobs = jobs[2:].strip()\n", " jobs = jobs[2:].strip()\r\n",
" job.append(jobs)\n", " job.append(jobs)\r\n",
" except:\n", " except:\r\n",
" jobs = \"N/A\"\n", " jobs = \"N/A\"\r\n",
" job.append(jobs)\n", " job.append(jobs)\r\n",
"else:\n", "else:\r\n",
" for row in table:\n", " for row in table:\r\n",
" name.append(row.find(\"h6\", class_=\"unset-margin__top unset-margin__bottom\").text)\n", " name.append(row.find(\"h6\", class_=\"unset-margin__top unset-margin__bottom\").text)\r\n",
" graduated_and_job=row.find_all(\"span\", class_=\"subtitle\")\n", " graduated_and_job=row.find_all(\"span\", class_=\"subtitle\")\r\n",
" if len(graduated_and_job)==2:\n", " if len(graduated_and_job)==2:\r\n",
" job.append(graduated_and_job[0].text.split(' |')[0])\n", " job.append(graduated_and_job[0].text.split(' |')[0])\r\n",
" else:\n", " else:\r\n",
" job.append(\"N/A\")\n", " job.append(\"N/A\")\r\n",
"print(name)\n", "print(name)\r\n",
"print(job)\n", "print(job)\r\n",
"review_created_date = soup.find_all(\"p\", class_=\"unset-margin__top subtitle\")\n", "review_created_date = soup.find_all(\"p\", class_=\"unset-margin__top subtitle\")\r\n",
"review_text_trunc = soup.find_all(\"span\", class_=\"truncatable\")\n", "review_text_trunc = soup.find_all(\"span\", class_=\"truncatable\")\r\n",
"ovr_cirr_job_rating = soup.find_all(\"div\", class_=\"rating-icons__filled\")" "ovr_cirr_job_rating = soup.find_all(\"div\", class_=\"rating-icons__filled\")"
], ],
"execution_count": 13,
"outputs": [ "outputs": [
{ {
"output_type": "stream", "output_type": "stream",
"name": "stdout", "name": "stdout",
"text": [ "text": [
"['S. Dali', 'Javier', 'Anastasia', 'Ottmar', 'Gabriel Buehler', 'Anonymous', 'Anonymous', 'Steve Rusev', 'Miles Pomeroy', 'Yulia Dorofeyuk']\n", "['John Farrell', 'Anonymous', 'Kevin Hillaker', 'Jaclyn', 'Anshu', 'David', 'JC', 'Saurabh Agrawal', 'Guilherme', 'Ryan M.']\n",
"['Data Scientist', 'Economist', 'Portfolio Manager', 'N/A', 'Sr Account Executive', 'N/A', 'N/A', 'N/A', 'N/A', 'N/A']\n" "['Director of Technology', 'Data Engineer', 'Data Engineer', 'Data Analyst', 'N/A', 'Laboratory Informatics Consultant', 'Senior Data Engineer', 'Software Engineer IV', 'N/A', 'Trader']\n"
] ]
} }
] ],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "GFFmS_lFMmoD",
"outputId": "4b85390a-3a2b-4339-f57b-348981f8c906"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "5FfzkuLJMmoH"
},
"source": [ "source": [
"#### Create a list of dates" "#### Create a list of dates"
] ],
"metadata": {
"id": "5FfzkuLJMmoH"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": 6,
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "2JwIIhadMmoI",
"outputId": "94f2aa9f-ecff-4664-dfa9-e1ee7eadc4d5"
},
"source": [ "source": [
"datelist = [x.text for x in review_created_date]\n", "datelist = [x.text for x in review_created_date]\r\n",
"print(datelist)\n", "print(datelist)\r\n",
"print(len(datelist))" "print(len(datelist))"
], ],
"execution_count": 6,
"outputs": [ "outputs": [
{ {
"output_type": "stream", "output_type": "stream",
...@@ -241,26 +236,27 @@ ...@@ -241,26 +236,27 @@
"10\n" "10\n"
] ]
} }
] ],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "2JwIIhadMmoI",
"outputId": "94f2aa9f-ecff-4664-dfa9-e1ee7eadc4d5"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "SCJj5h6VMmoJ"
},
"source": [ "source": [
"#### Gather Reviews" "#### Gather Reviews"
] ],
"metadata": {
"id": "SCJj5h6VMmoJ"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": null,
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "6RmBASnZMmoJ",
"outputId": "2907d340-bdad-4a5c-fca0-3d8db04c52de"
},
"source": [ "source": [
"review_list = [x.text.replace('Read More','') for x in review_text_trunc[1:]]\n", "review_list = [x.text.replace('Read More','') for x in review_text_trunc[1:]]\n",
"review_list = [x.replace('Read Less','') for x in review_list]\n", "review_list = [x.replace('Read Less','') for x in review_list]\n",
...@@ -268,7 +264,6 @@ ...@@ -268,7 +264,6 @@
"print(review_list)\n", "print(review_list)\n",
"print(len(review_list))" "print(len(review_list))"
], ],
"execution_count": null,
"outputs": [ "outputs": [
{ {
"output_type": "stream", "output_type": "stream",
...@@ -278,17 +273,18 @@ ...@@ -278,17 +273,18 @@
"9\n" "9\n"
] ]
} }
] ],
},
{
"cell_type": "code",
"metadata": { "metadata": {
"colab": { "colab": {
"base_uri": "https://localhost:8080/" "base_uri": "https://localhost:8080/"
}, },
"id": "lI7nRc19MmoK", "id": "6RmBASnZMmoJ",
"outputId": "27f45f13-a192-4a16-ba5b-1075c7627d67" "outputId": "2907d340-bdad-4a5c-fca0-3d8db04c52de"
}
}, },
{
"cell_type": "code",
"execution_count": null,
"source": [ "source": [
"#NOTE First 4 positions in list are for DSD - 0 = overall, 1 = overall (average), 2 = Curriculum, 3 = job support ||\n", "#NOTE First 4 positions in list are for DSD - 0 = overall, 1 = overall (average), 2 = Curriculum, 3 = job support ||\n",
"#NOTE Currently the last 7 positions are for ads - but that may change - basically anything not divisible by 20 is either DSD overall or an ad. \n", "#NOTE Currently the last 7 positions are for ads - but that may change - basically anything not divisible by 20 is either DSD overall or an ad. \n",
...@@ -325,7 +321,6 @@ ...@@ -325,7 +321,6 @@
"print(curriculum_rating)\n", "print(curriculum_rating)\n",
"print(job_support_rating)" "print(job_support_rating)"
], ],
"execution_count": null,
"outputs": [ "outputs": [
{ {
"output_type": "stream", "output_type": "stream",
...@@ -338,27 +333,27 @@ ...@@ -338,27 +333,27 @@
"['100', 'NA', 'NA', '100', '100', '100', 'NA', 'NA', 'NA', 'NA']\n" "['100', 'NA', 'NA', '100', '100', '100', 'NA', 'NA', 'NA', 'NA']\n"
] ]
} }
] ],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "lI7nRc19MmoK",
"outputId": "27f45f13-a192-4a16-ba5b-1075c7627d67"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "I4gCDNycMmoM"
},
"source": [ "source": [
"Add each list" "Add each list"
] ],
"metadata": {
"id": "I4gCDNycMmoM"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": null,
"colab": {
"base_uri": "https://localhost:8080/",
"height": 310
},
"id": "7mr4gM07MmoN",
"outputId": "8a0a10f7-657e-437f-f68a-d3f5539fb214"
},
"source": [ "source": [
"import pandas as pd\n", "import pandas as pd\n",
"\n", "\n",
...@@ -367,7 +362,6 @@ ...@@ -367,7 +362,6 @@
"switchup_reviews.columns=['Name','Job','Date of Review', 'Review','Overall Rating','Curriculum Rating','Job Support Rating']\n", "switchup_reviews.columns=['Name','Job','Date of Review', 'Review','Overall Rating','Curriculum Rating','Job Support Rating']\n",
"switchup_reviews.head()\n" "switchup_reviews.head()\n"
], ],
"execution_count": null,
"outputs": [ "outputs": [
{ {
"output_type": "execute_result", "output_type": "execute_result",
...@@ -469,46 +463,48 @@ ...@@ -469,46 +463,48 @@
"metadata": {}, "metadata": {},
"execution_count": 10 "execution_count": 10
} }
] ],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/",
"height": 310
},
"id": "7mr4gM07MmoN",
"outputId": "8a0a10f7-657e-437f-f68a-d3f5539fb214"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "bgvBSam4MmoN"
},
"source": [ "source": [
"#### Saving" "#### Saving"
] ],
"metadata": {
"id": "bgvBSam4MmoN"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": null,
"id": "dSI01rNlMmoO"
},
"source": [ "source": [
"switchup_reviews.to_csv('switchup_reviews.csv', index=False)" "switchup_reviews.to_csv('switchup_reviews.csv', index=False)"
], ],
"execution_count": null, "outputs": [],
"outputs": [] "metadata": {
"id": "dSI01rNlMmoO"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "8Np2hXKvMmoP"
},
"source": [ "source": [
"# Extra code from other methods" "# Extra code from other methods"
] ],
"metadata": {
"id": "8Np2hXKvMmoP"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": null,
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "dtYADOQTMmoP",
"outputId": "6a3a190b-b587-48a8-bfd0-e4376c2a5a84"
},
"source": [ "source": [
"with open(\"Review page 1.txt\", \"w+\") as f:\n", "with open(\"Review page 1.txt\", \"w+\") as f:\n",
" for job_element in job_elements:\n", " for job_element in job_elements:\n",
...@@ -531,7 +527,6 @@ ...@@ -531,7 +527,6 @@
"\n", "\n",
"j.close" "j.close"
], ],
"execution_count": null,
"outputs": [ "outputs": [
{ {
"output_type": "execute_result", "output_type": "execute_result",
...@@ -543,26 +538,27 @@ ...@@ -543,26 +538,27 @@
"metadata": {}, "metadata": {},
"execution_count": 12 "execution_count": 12
} }
] ],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "dtYADOQTMmoP",
"outputId": "6a3a190b-b587-48a8-bfd0-e4376c2a5a84"
}
}, },
{ {
"cell_type": "markdown", "cell_type": "markdown",
"metadata": {
"id": "29BRXnoyMmoQ"
},
"source": [ "source": [
"#### Extra code" "#### Extra code"
] ],
"metadata": {
"id": "29BRXnoyMmoQ"
}
}, },
{ {
"cell_type": "code", "cell_type": "code",
"metadata": { "execution_count": null,
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "IhuL9TNNMmoQ",
"outputId": "22dbe11c-ec17-4ed5-e004-83c08a6db316"
},
"source": [ "source": [
"import pandas as pd\n", "import pandas as pd\n",
"from html.parser import HTMLParser\n", "from html.parser import HTMLParser\n",
...@@ -601,7 +597,6 @@ ...@@ -601,7 +597,6 @@
"print(reviewDetails[2])\n", "print(reviewDetails[2])\n",
"print(reviewDetails[3])\n" "print(reviewDetails[3])\n"
], ],
"execution_count": null,
"outputs": [ "outputs": [
{ {
"output_type": "stream", "output_type": "stream",
...@@ -614,7 +609,14 @@ ...@@ -614,7 +609,14 @@
"['\\n ', '\\n ', '\\n ', '\\n ', '\\n Not Sure Where to Apply?\\n ', '\\n ']\n" "['\\n ', '\\n ', '\\n ', '\\n ', '\\n Not Sure Where to Apply?\\n ', '\\n ']\n"
] ]
} }
] ],
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "IhuL9TNNMmoQ",
"outputId": "22dbe11c-ec17-4ed5-e004-83c08a6db316"
}
} }
] ]
} }
\ No newline at end of file
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment