Project reference

If you want to cite this work, please simply refer to the github project:

GROBID (2008-2021) <>

Please do not include a particular person name to emphasize the project and the tool!

We also ask you not to cite any old research papers, but the current project itself, because, yes, we can cite a software project in the bibliographical references and not just mention it in a foot note ;)

Here's a BibTeX entry using the Software Heritage project-level permanent identifier:

    title = {GROBID},
    howpublished = {\url{}},
    publisher = {GitHub},
    year = {2008--2021},
    archivePrefix = {swh},
    eprint = {1:dir:dab86b296e3c3216e2241968f0d63b68e8209d3c}

Presentations on Grobid

GROBID in 30 slides (2015).

GROBID in 20 slides (2012).

P. Lopez. Automatic Extraction and Resolution of Bibliographical References in Patent Documents. First Information Retrieval Facility Conference (IRFC), Vienna, May 2010. LNCS 6107, pp. 120-135. Springer, Heidelberg, 2010.

Evaluation and usages

The following articles are provided for information - it does not mean that we agree with all their statements about Grobid (please refer to the present documentation for the actual features and capacities of the tool) or with all the various methodologies used for evaluation, but they all explore interesting aspects with Grobid.

Articles on CRF for bibliographical reference parsing

Other similar Open Source tools

CiteSeerX page on Scholarly Information Extraction which lists tools and related information (ok now outdated).