Python binding to approach0 It includes the operator tree parser that approach0 uses, and a linear tokenizer for latex, of course, it has core functionalities of the approach0 system. The tokenizer is useful if you want to apply it for transformer because it can reduce the vocabulary (e.g., \frac vs \dfrac) at the same time treating the tokens specially.
A fork of the Tangent-CFT search engine (Python) that recognizes all LaTeX operators in arXMLiv 2019 and M-SE. Can be used to produce SLT and OPT out of MathML.