LaTeXML à la carte
07 December, 2023

An invited talk for AIM Cyber Infrastructure Workshop, covering HTML for arXiv, ar5iv, LaTeXML and MathML.

Welcome to ar5iv! Wrestling with the Open Problems of Scholarly Writing
22 September, 2022

An invited talk for CICM 2022, announcing the newly created ar5iv platform. ar5iv is an arXiv Labs project, attempting to provide an HTML5 conversion of all arXiv articles with LaTeX sources, via LaTeXML.
  CICM 2022 ,   ar5iv ,   arXiv ,   LaTeXML

ar5iv.org — a preview site for the arXMLiv dataset
07 February, 2022

Launching a preview installation for 1.797 million arXiv preprints, in HTML5. Goal: reintegrate into arXiv.org.
  AITP16 ,   Math NLP ,   Corpora ,   arXiv ,   LaTeXML

Encyclopedic Intent: A proposal for fully accessible MathML narration
21 November, 2021

A proposal on principles and design approach for MathML Intent.

Scientific Statement Classification over arXiv.org
12 April, 2021

The scientific statements annotated in papers from arXiv.org as a classification task. Baselines, ablations and live showcase.
  AITP16 ,   Math NLP ,   Corpora ,   arXiv ,   LaTeXML

Language and Mathematics Model Pretraining in 2021
05 December, 2020

Research overview on large-scale natural language processing on math-rich scientific documents, as carried out by the KWARC research group.
  AITP16 ,   Math NLP ,   Corpora ,   arXiv ,   LaTeXML

Math-rich NLP on Billion Token Corpora
05 April, 2016

Research overview on large-scale natural language processing on math-rich scientific documents, as carried out by the KWARC research group.
  AITP16 ,   Math NLP ,   Corpora ,   arXiv ,   LaTeXML

Live Mathematics on Authorea
13 July, 2015

A case for transparency in science, exemplified via a showcase of active mathematical features on Authorea.com
  MathUI ,   Live Mathematics ,   Authorea ,   Showcase ,   Flot ,   D3JS ,   iPython ,   LaTeXML ,   Pandoc