Coursera-Course-Dataset
OpenML dataset with id 43381
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22102206/Coursera-Course-Dataset.arff
Upload date: 23 March 2022
Dataset Characteristics
Number of features: 7 (numeric: 2, symbolic: 0 and in total binary: 0 )
Number of instances: 891
Number of instances with missing values: 0
Number of missing values: 0
Context This is a dataset i generated during a hackathon for project purpose. Here i have scrapped data from Coursera official web site. Our project aims to help any new learner get the right course to learn by just answering a few questions. It is an intelligent course recommendation system. Hence we had to scrap data from few educational websites. This is data scrapped from Coursera website. For the project visit: https://github.com/Siddharth1698/Coursu . Please do show your support by following us. I have just started to learn on data science and hope this dataset will be helpful to someone for his/her personal purposes. The scrapping code is here : https://github.com/Siddharth1698/Coursera-Course-Dataset Article about the dataset generation : https://medium.com/analytics-vidhya/web-scraping-and-coursera-8db6af45d83f
Content This dataset contains mainly 6 columns and 890 course data. The detailed description:
course_title : Contains the course title. course_organization : It tells which organization is conducting the courses. courseCertificatetype : It has details about what are the different certifications available in courses. course_rating : It has the ratings associated with each course. course_difficulty : It tells about how difficult or what is the level of the course. coursestudentsenrolled : It has the number of students that are enrolled in the course.
Inspiration
This is just one of my first scraped dataset. Follow my GitHub for more: https://github.com/Siddharth1698
This page was built for dataset: Coursera-Course-Dataset