PDF_parser is a bake-off framework for comparing PDF parsers on scientific / biomedical PDFs, especially review papers. The goal is to evaluate which parser output is most suitable for downstream LLM ...