Sudoku a popular number placement puzzle. It is played on a 9 by 9 grid where the objective is to fill each cell in the grid with digits from 1 to 9 such that each column, each row and each 3 by 3 sub-grids (called blocks) have every digit from 1 to 9. The puzzle starts with some cells pre-filled and the player has to fill the rest to reach the solution. Since each unit (column, row or block) has 9 cells and has to be filled with all 9 digits - 1 to 9 - there cannot be any duplicates in a unit.

{% img https://upload.wikimedia.org/wikipedia/commons/thumb/f/ff/Sudoku-by-L2G-20050714.svg/250px-Sudoku-by-L2G-20050714.svg.png 200 A typical Sudoku puzzle %} {% img https://upload.wikimedia.org/wikipedia/commons/thumb/3/31/Sudoku-by-L2G-20050714_solution.svg/250px-Sudoku-by-L2G-20050714_solution.svg.png 200 The same puzzle with solution numbers marked in red %}

In this post we look at how to solve a Sudoku with Haskell.

The code in this post has dependencies on the split package from Hackage.

Basic setup

{-# LANGUAGE BangPatterns, RecordWildCards #-}

module Sudoku where

import qualified Data.Set as S
import qualified Data.Map as M
import Data.Char (digitToInt, intToDigit)
import Data.List (foldl', intersperse, intercalate, sortBy, nub)
import Data.List.Split (chunksOf)
import Data.Maybe (fromJust)
import Data.Ord (comparing)
import Control.Monad (foldM, guard)

Now that the imports are out of the way let's setup the basic functionalities.

data Digit = ONE | TWO | TRE | FOR | FIV | SIX | SVN | EGT | NIN
             deriving (Eq, Ord, Enum)

instance Show Digit where
  show digit = show $ fromEnum digit + 1

allDigits = S.fromList [ONE .. NIN]

data Cell = Cell { cellIdx :: Int, cellVals :: S.Set Digit }
            deriving (Eq, Ord)

instance Show Cell where
  show Cell{..} = "<" ++ show cellIdx ++ " " ++ show (S.toList cellVals) ++">"

newtype Board = Board (M.Map Int Cell)
                deriving (Eq, Ord)

boardCells :: Board -> [Cell]
boardCells (Board ixMap) = M.elems ixMap

cellAt :: Board -> Int -> Maybe Cell
cellAt (Board ixMap) idx = M.lookup idx ixMap

updateBoard :: Board -> Cell -> Board
updateBoard (Board ixMap) cell@Cell{..} = Board (M.insert cellIdx cell ixMap)

emptyBoard :: Board
emptyBoard =
  Board $ foldl' (\m i -> M.insert i (Cell i allDigits) m) M.empty [0 .. 80]

A Digit is just one of the nine possible values. It derives Eq, Ord and Enum. We use the fact the Digit is enumerable to create a custom Show instance which is just the Digit's ordinal plus one so that show ONE gives "1".

A Cell has an index cellIdx and a set of possible digit values cellVals. The cell index is a number between 0 to 80 inclusive. The cell values denote the possible values the cell holds without violating the rules of Sudoku. If a cell is filled, it holds only one value. We create a custom Show instance of Cell to pretty print it.

A Board is just a wrapper over a map from the cell index to the corresponding cell. We use a map instead of a simple list of Cells for faster lookups.

boardCells, cellAt and updateBoard are some convenience functions to manipulate a board. boardCells returns a list of all the cells in a board, cellAt returns a cell in a board at a given index and updateBoard update a given cell in a board.

emptyBoard creates an empty board, with all the cells, unfilled by folding over all the index list and inserting a cell with all possible digits in the map corresponding to each index.

Reading and printing the Sudoku

Next let's write some functions to read a Sudoku board from a string and to print a board so that we can start playing with the actual examples. The board is represented as a single line with one digit for each cell if it is filled otherwise a dot .. The cells are read row first, left to right column. An example:

readBoard :: String -> Maybe Board
readBoard str = do
  guard $ length str == 81
  foldM (\board (i, chr) -> do
            guard $ chr == '.' || (chr `S.member` S.fromList ['1' .. '9'])
            let cellVals = if chr == '.'
                             then allDigits
                             else S.singleton $ toEnum $ digitToInt chr - 1
            return $ updateBoard board (Cell i cellVals))
        $ zip [0 .. 80 ] str

readBoard converts a string to a Board. It returns Just Board if the string represents a valid Sudoku board, otherwise it returns Nothing. We use the Monad nature of Maybe to guard against the possible failures. The guards fail if the string length is not exactly 81 or if it contains characters other than 1 to 9 and .. The cells in the board returned have exactly one cell value if the string contained a digit at the cell index else they have all the digits as cell values.

showBoard :: Board -> String
showBoard =
  map (\Cell{..} ->
          if S.size cellVals == 1
            then intToDigit . (+ 1) . fromEnum . head . S.toList $ cellVals
            else '.')
  . boardCells

asciiShowBoard :: Board -> String
asciiShowBoard =
  (\t -> border ++ "\n" ++ t ++ border ++ "\n")
  . unlines . intercalate [border] . chunksOf 3
  . map ((\r -> "| " ++ r ++ " |")
         . intercalate " | " . map (intersperse ' ') . chunksOf 3)
  . chunksOf 9
  . showBoard
  where border = "+-------+-------+-------+"

instance Show Board where
  show = showBoard

showBoard does the reverse of readBoard. It takes a board and creates a valid string representation of it. It does so by mapping over each cell of the board in the order of their index and outputting the digit if the cell is filled else ..

asciiShowBoard converts a board to an ASCII graphic Sudoku board like we are used to see. It does so by taking the output of showBoard, breaking it into chunks corresponding to rows and blocks, inserting spaces and | at appropriate places and then joining them with the borders made of -.

Lastly, we add a Show instance of Board using showBoard.

Here is an example run in ghci:

*Sudoku> let boardStr = "6..3.2....4.....1..........7.26............543.........8.15........4.2........7.."
*Sudoku> let (Just board) = readBoard boardStr
*Sudoku> showBoard board
*Sudoku> putStr (asciiShowBoard board)
| 6 . . | 3 . 2 | . . . |
| . 4 . | . . . | . 1 . |
| . . . | . . . | . . . |
| 7 . 2 | 6 . . | . . . |
| . . . | . . . | . 5 4 |
| 3 . . | . . . | . . . |
| . 8 . | 1 5 . | . . . |
| . . . | . 4 . | 2 . . |
| . . . | . . . | 7 . . |

Is it solved yet?

Before we proceed to write a full-fledged Sudoku solver, we must have a function which tells us whether a board is filled completely and whether that solution is a valid one.

                  deriving (Eq, Show)

boardState :: Board -> BoardState
boardState board
  | any (\Cell{..} -> S.size cellVals /= 1) $ boardCells board = INCOMPLETE
  | any isUnitInvalid units = INVALID
  | otherwise = SOLVED
    isUnitInvalid unitCells =
      (S.fromList . map (head . S.toList . cellVals) $ unitCells) /= allDigits

    units = map (map (fromJust . cellAt board)) unitIxs

unitIxs   = rowIxs ++ columnIxs ++ blockIxs
rowIxs    = map (\i -> [i * 9 .. i * 9 + 8]) [0..8]
columnIxs = map (\i -> take 9 [i, i + 9 ..]) [0..8]
blockIxs  =
  concatMap (\(row1:row2:row3:_) ->
                zipWith3 (\blockRow1 blockRow2 blockRow3 ->
                             blockRow1 ++ blockRow2 ++ blockRow3)
                         row1 row2 row3)
  . chunksOf 3 . map (chunksOf 3) $ rowIxs

We start by defining the board state as an enumeration of three value corresponding to the solved, incomplete and invalid states. The boardState function takes a board and gives its current state. It does so by checking three conditions:

  1. if any cell in the board does not have only one possible value then the board is incomplete
  2. if any unit of the board is invalid then the solution is invalid
  3. else the board is solved

To find if an unit is invalid, we take all the cells of the unit and check if they have all the digits in between them as per the rules of Sudoku.

Units are found just by looking up the indexes from the board for each unit. Unit indexes are all the row, column and block indexes taken together. Row and column indexes can be obtained from the simple mathematical formulas. Block indexes are a little trickier to get. It involves taking the row indexes, splitting each row into chunks of three columns, then taking three rows at a time and mapping and concatenating them with a function which zips three rows at a time creating the block indexes.

A run in ghci shows the indexes to be correct:

*Sudoku> mapM_ print rowIxs

*Sudoku> mapM_ print columnIxs

*Sudoku> mapM_ print blockIxs

See how the row indexes follow the grid indexes as we have taken our grid indexes to be rows first, left to right. If we take row indexes column-wise we get the column indexes. If we take the row indexes block-wise we get the block indexes.

Let's do a few sample runs of boardState in ghci:

*Sudoku> let (Just board) = readBoard "483921657967345821251876493548132976729564138136798245372689514814253769695417382"
*Sudoku> putStr (asciiShowBoard board)
| 4 8 3 | 9 2 1 | 6 5 7 |
| 9 6 7 | 3 4 5 | 8 2 1 |
| 2 5 1 | 8 7 6 | 4 9 3 |
| 5 4 8 | 1 3 2 | 9 7 6 |
| 7 2 9 | 5 6 4 | 1 3 8 |
| 1 3 6 | 7 9 8 | 2 4 5 |
| 3 7 2 | 6 8 9 | 5 1 4 |
| 8 1 4 | 2 5 3 | 7 6 9 |
| 6 9 5 | 4 1 7 | 3 8 2 |
*Sudoku> boardState board
*Sudoku> let (Just board) = readBoard "48392165796734582125187649354813297672956413.136798245372689514814253769695417382"
*Sudoku> boardState board
*Sudoku> let (Just board) = readBoard "183921657967345821251876493548132976729564138136798245372689514814253769695417382"
*Sudoku> boardState board

That seems to be working. Now let's move on to actually solving the Sudoku!

One way to solve Sudoku is to think of it as a graph search problem. Each board configuration becomes a node in the search graph with the moves linking them as edges. A move is filling a particular cell with a digit. So now we can solve the board just by finding a path from the given board configuration to a configuration where all cells are filled.

We can use Depth first Search (DFS) algorithm to accomplish this. DFS is a brute force technique and in worst case it may visit all the nodes in the search graph. In case of Sudoku, this search graph is very large (approximately 6.67×1021) so this is not a very efficient way of solving Sudoku. For now, we'll add one optimization in DFS: while listing the next possible configurations for a particular configuration, we start with the cell with smallest number of cell values. This does not help us in the worst case but it will generally speed up things a little. We can write this in two parts: a general DFS function and a solver which uses it to solve a Sudoku.

dfs :: Ord a => a -> (a -> [a]) -> (a -> Bool) -> [a]
dfs start getNext isGoal = go start S.empty
    go node visited
      | isGoal node = [node]
      | S.member node visited = []
      | otherwise = concatMap (\nextNode ->
                               go nextNode (S.insert node visited))
                              (getNext node)

dfsSolver :: Board -> [Board]
dfsSolver board = dfs board nextBoards ((== SOLVED) . boardState)
    nextBoards board =
      map (updateBoard board)
      . concatMap (\Cell{..} -> map (Cell cellIdx . S.singleton) . S.toList $ cellVals)
      . sortBy (comparing (S.size . cellVals))
      . filter ((/= 1) . S.size . cellVals)
      . boardCells
      $ board

The dfs functions is a literal translation of the DFS algorithm. It visits the graph node by node, keeping a track of the nodes visited and the goals seen till now. It first checks if the current node is a goal and if so it just returns it wrapped in a list. If the current node is not a goal, it checks if has already been visited and if so just returns an empty list. This is done to avoid getting stuck into infinite loops, going down the same path again and again. If both of these checks fail then it gets to the general case in which it just finds all the next nodes for the current node, recursively maps itself over them accumulating all the goals found (by flattening the lists) and returns them.

dfsSolver uses dfs to solve Sudoku as a DFS by supplying the getNext and isGoal functions. the isGoal function is simple, it just checks if the current board is solved by calling the boardState function. nextBoards is a little complicated so let's go over it step by step, reading from bottom to up:

  1. it gets all the cells in the board
  2. filters in only the empty cells, the cells with more than one cell values
  3. sorts the empty cells comparing the count of their cell values
  4. for each cell, it takes each cell value and create a cell containing only that cell value and flattens this list of list of cells into a list of cells
  5. for each cell so created, it creates a board by updating the current board with that cell

So in effect, it goes over all the empty cells in the board in ascending order of their cell value count, picking each cell value in turn and creating a board where that cell is filled with that cell value. Hence it outputs all the next board configurations for the current board.

And then dfs goes to work; it goes over all the whole graph and finds all the solutions. Note that since a solution is reachable from more than one path (fill cell 1 first and then cell 2 or do it in reverse order), the solutions returned are in general not unique. Also, because of the concatMap in dfs, it finds the solutions one by one in a lazy fashion. So it is possible to stop the search early and just get the first solution found.

Let's try this out now in ghci:

*Sudoku> let boardStr = ".839216579.734582125187649354813297672956413813679824537268951481425376969541738."
*Sudoku> let (Just board) = readBoard boardStr
*Sudoku> :set +s
*Sudoku> (mapM_ print . nub . dfsSolver) board
(1.75 secs, 356010984 bytes)
*Sudoku> let boardStr = ".839216579.734582125187.49354813297672956413813679824537268951481425376969541738."
*Sudoku> let (Just board) = readBoard boardStr
*Sudoku> :set +s
*Sudoku> (mapM_ print . nub . dfsSolver) board
(66.64 secs, 13136374896 bytes)
*Sudoku> (print . head . dfsSolver) board
(7.32 secs, 1322834576 bytes)

And it solves the Sudoku as expected! Note how we have to use nub to find the unique solutions. Also note how the time take increases from 1.75 secs for 3 empty cells to 66.64 secs for 4 empty cells, indicating the exponential nature of the problem graph and the brute force nature of the solver. The first solution in case of 4 empty cells is however found in just 7.32 secs using head to stop the search early.

So we have now successfully written our first Sudoku solver. Too bad it can't be used for solving boards with more than few empty cells. Let's see a way to improve the solver drastically, in the next section.

Constraint propagation

Get the code

This post can be downloaded as a compilable Literate Haskell file here. The Haskell code in the post can be downloaded here or can be forked here.