I’ve heard many people complain about this, but why do researchers insist on sharing their data in a PDF document? Especially a 600+ page PDF table in which the columns select before the rows (sometimes).
Tiers of data sharing:
Platinum: SQL dump, SQLite database
Gold: CSV or tab-delimited text
Silver: MS Excel (beware of mangled gene names like ‘1-Sep’ for SEPT1)
Bronze: MS Word