Skip to content

Benchmaking pmc

Header metadata

Evaluation on 1943 random PDF files out of 1941 PDF (ratio 1.0).

Strict Matching (exact matches)

Field-level results

label precision recall f1 support
authors 93.03 92.84 92.93 1941
first_author 96.64 96.45 96.54 1941
title 84.44 84.05 84.24 1943
all fields (micro avg.) 91.37 91.11 91.24 5825
all fields (macro avg.) 91.37 91.11 91.24 5825

Soft Matching (ignoring punctuation, case and space characters mismatches)

Field-level results

label precision recall f1 support
authors 94.99 94.8 94.89 1941
first_author 97.06 96.86 96.96 1941
title 92.14 91.71 91.93 1943
all fields (micro avg.) 94.73 94.45 94.59 5825
all fields (macro avg.) 94.73 94.46 94.59 5825

Levenshtein Matching (Minimum Levenshtein distance at 0.8)

Field-level results

label precision recall f1 support
authors 96.8 96.6 96.7 1941
first_author 97.32 97.11 97.22 1941
title 98.29 97.84 98.07 1943
all fields (micro avg.) 97.47 97.18 97.33 5825
all fields (macro avg.) 97.47 97.18 97.33 5825

Ratcliff/Obershelp Matching (Minimum Ratcliff/Obershelp similarity at 0.95)

Field-level results

label precision recall f1 support
authors 95.92 95.72 95.82 1941
first_author 96.64 96.45 96.54 1941
title 96.28 95.83 96.05 1943
all fields (micro avg.) 96.28 96 96.14 5825
all fields (macro avg.) 96.28 96 96.14 5825

Instance-level results

Total expected instances:   1943
Total correct instances:    1531 (strict) 
Total correct instances:    1699 (soft) 
Total correct instances:    1840 (Levenshtein) 
Total correct instances:    1786 (ObservedRatcliffObershelp) 

Instance-level recall:  78.8    (strict) 
Instance-level recall:  87.44   (soft) 
Instance-level recall:  94.7    (Levenshtein) 
Instance-level recall:  91.92   (RatcliffObershelp) 

Evaluation metrics produced in 12.202 seconds