Abstract
With the increasing development of academic research, thesis writing and publication have become important ways to measure scientific research achievements. However, non-standard thesis formats not only affect the reading experience, but may also lead to a decrease in the evaluation of academic achievements. Therefore, this article proposes a rule engine based automatic paper format detection system, aiming to improve the efficiency and accuracy of thesis format review and reduce manual review costs. This system is built on the basis of the Office Open XML document specification, which formulates a series of format checking rules for the basic format requirements of thesis. Then, the rule engine conducts in-depth checks on the thesis based on the preset format specification, automatically identifies format problems that do not comply with the specification, and outputs a detection report. It provides modification suggestions to assist users in quickly correcting format errors.