3-million-Sudoku-puzzles-with-ratings
OpenML dataset with id 43476
Author name not available (Why is that?)
Full work available at URL: https://api.openml.org/data/v1/download/22102301/3-million-Sudoku-puzzles-with-ratings.arff
Upload date: 23 March 2022
Dataset Characteristics
Number of classes: 0
Number of features: 4 (numeric: 2, symbolic: 0 and in total binary: 0 )
Number of instances: 3,000,000
Number of instances with missing values: 0
Number of missing values: 0
Overview
This dataset contains 3 million Sudoku puzzles and their solutions. The level of difficulty varies -- some can be solved easily by a beginner, while others will challenge experienced solvers. Most puzzles have between 23 and 26 clues. The minimum number of clues in the dataset is 19, and the maximum is 31. It has been shown that 17 is the minimum number of clues for a valid, uniquely solvable Sudoku puzzle. However, these puzzles are difficult to find, so they are not included in our dataset.
Each row of the dataset includes the number of clues and an estimated difficulty rating. The difficulty rating is computed by an automated solver and it is based on the average search tree depth over 10 attempts. 43 of the puzzles have a difficulty of zero, meaning that it can be solved using a simple scanning technique. The highest difficulty rating is 8.5.
The puzzles were generated using Blagovest Dachev's Sudoku generator and solver, at https://github.com/dachev/sudoku.
This page was built for dataset: 3-million-Sudoku-puzzles-with-ratings