Abstract: Understanding the structural layout of PDF documents is a core component of information extraction and intelligent document analysis. However, due to the lack of explicit semantic ...