5.0 · 500+ projects delivered

arXiv to BibTeX Converter

Free. No signup. Runs in your browser.

Paste an arXiv ID, get clean BibTeX. The eprint, archivePrefix, and primaryClass fields come out the way biblatex expects, citekeys read like vaswani2017attention, and acronyms in titles stay capitalised – ready to drop into your .bib without breaking the compile.

How to convert arXiv IDs to a BibTeX file online

Visit the arXiv to BibTeX tool page You’re already here. Bookmark thelatexlab.com/arxiv-to-bibtex/ for next time.
Step 1
Paste your arXiv ID One ID like 1706.03762, or up to 50 – one per line – for batch conversion. Old-style IDs like math/0506203 and hep-th/9901001 work too. If you paste 1706.03762v2, the version suffix is kept.
Step 2
Pick the output dialect BibLaTeX preserves UTF-8 for biber. Legacy BibTeX rewrites accents as LaTeX macros (Kr{\"a}mer) for older bibtex setups. Default is BibLaTeX.
Step 3
Convert, then copy or download Each entry renders with its inferred type and a per-row entry-type override. Use the per-row Copy button, or Copy all / Download .bib at the bottom for the full batch.
Step 4
Lookups go through our small cPanel passthrough at thelatexlab.com/tools/arxiv-proxy.php because arXiv’s API doesn’t send CORS headers for browser-direct fetches. The proxy caches by ID hash and rate-limits by IP. Bibliographic content is never logged or stored. Entries that fail to resolve show up inline with the reason so you can fix or drop them without losing the rest of the batch.

Where most arXiv converters break

The eprint field gets a doubled “arXiv:” prefix Most converters emit eprint = {arXiv:1706.03762}. Under biblatex, the \eprint macro then prepends “arXiv:” itself at render time, so the rendered reference reads arXiv:arXiv:1706.03762. The acmart class hit this exact bug (issue #382). The fix is to emit the bare ID and put the prefix in archivePrefix instead.
3 fields eprint + archivePrefix + primaryClass
Citekeys are the raw arXiv ID arxiv2bibtex.org returns @misc{1706.03762, ...}. Numeric citekeys break alpha.bst labels, make \cite{...} unreadable in the source, and turn search-and-replace across a .bib into a numeric game. vaswani2017attention is the convention every reference manager defaults to for a reason.
vaswani2017attention lastnameYearWord, stable
Acronyms in titles get lowercased arXiv ML titles are full of RNN, LSTM, BERT, GAN. Under plain, abbrv, IEEEtran, and acmart bibstyles, an unprotected title gets lowercased: “BERT for question answering” becomes “Bert for question answering”. The portable fix is to wrap all-caps tokens in braces: {BERT}, {RNN}. Inline math like $O(n \log n)$ is detected and passed through unescaped – if you’ve ever debugged a “Missing $ inserted” failure from a copy-pasted title, this is the upstream cause.
{RNN} applied at conversion
@misc is wrong once the paper is published Many ML arXiv papers end up at NeurIPS, ICLR, or ACL, but the arXiv record stays a preprint. Bibstyles render @misc with no venue and no pages, which reads poorly once a conference version exists. The tool defaults to @misc for true preprints and exposes a per-row entry-type override so you can switch a single entry to @inproceedings or @article without leaving the page. If the paper also has a journal DOI, /doi-to-bibtex/ is usually the cleaner path.
Override per-result dropdown
When arXiv has registered a DOI for the preprint (an opt-in metadata path introduced in 2022), it shows up in the BibTeX as a doi field alongside the eprint fields. Citekeys are lastnameYearWord with collision handling. CJK / Arabic / Cyrillic author names pass through as UTF-8 in both dialects – there’s no LaTeX accent-macro equivalent, so they need a Unicode-aware bibstyle or biber regardless.

Frequently asked questions

arXiv’s own guidance is to include the version so a reader sees the exact text you read. If you paste 1706.03762v2, the version suffix is preserved in the eprint field. If you paste 1706.03762 bare, the resolver returns metadata for the latest version and the eprint is written without a version suffix – which arXiv treats as “always the most recent revision”. Pick whichever matches what you actually read.

If a peer-reviewed journal version exists and you’ve read it, cite that – it has the final wording, the page numbers reviewers expect, and a stable DOI. Paste the journal DOI into /doi-to-bibtex/ for that path. Cite the arXiv version when there’s no published counterpart yet, when versions diverge in a way that matters for your argument, or when the field convention is to cite preprints (parts of ML and theoretical CS). If you have a stack of references inside a Word manuscript and want them out as BibTeX, the references-conversion guide covers the workflow.

The community hasn’t fully settled this. @misc with eprint + archivePrefix + primaryClass is the most common convention and what the tool defaults to. Biblatex users often prefer @online with eprinttype = {arXiv} for cleaner rendering under modern styles. Some groups use @article with journal = {arXiv preprint} – readable but not strictly accurate. Use the per-row override to switch on a per-entry basis.

Yes. Both old-style (pre-April 2007, with a subject prefix and a 7-digit number) and new-style (YYMM.NNNNN) IDs are accepted, with or without a v version suffix. The proxy regex covers both forms. If you have a URL like https://arxiv.org/abs/1706.03762, paste just the ID portion.

Free, no signup. The arXiv path is the one resolver that goes through our infrastructure – arXiv’s API doesn’t allow browser-direct fetches because it doesn’t send CORS headers. The cPanel proxy caches a record for 30 days when you specify a version (since versioned IDs are immutable) and 7 days otherwise, keyed by the ID hash. It rate-limits per IP. The bibliographic content itself is never logged or persisted. The output exists in your browser session and disappears when you close the tab.

The clean BibTeX output is the right starting point, but venue-specific templates each have quirks – IEEE’s IEEEtran expects specific field orderings, NeurIPS has its own bibstyle, ACL uses acl_natbib. The tool doesn’t enforce venue conformance. For arXiv submissions themselves (compile-on-arXiv requirements, embedded fonts, source-tree quirks) see our arXiv LaTeX formatting service; for IEEE, ACM, Elsevier, and Springer see IEEE, ACM, Elsevier, and Springer.

You Have a Word or PDF File. We'll Convert it to Submission-Ready LaTeX.

We convert Word documents and PDFs into clean, submission-ready LaTeX for IEEE, Springer, Elsevier, ACM, arXiv, and more. Equations, tables, citations, and journal templates included. Compiled and tested in Overleaf before delivery.

Reviewed by a real LaTeX specialist.