Abstract: This paper introduces a hybrid method for solving Sudoku puzzles by combining Deep Q-Learning (DQN) with heuristic backtracking. Sudoku is treated as a Constraint Satisfaction Problem (CSP), ...