Speech-to-LaTeX: New Models and Datasets for Converting Spoken Equations and Sentences
Paper
•
2508.03542
•
Published
•
5
Multimodal generative AI, VLM, image generation, video generation, mechanical interpretability, embodied AI