Kurator provides scientific workflow tools for data quality improvement of natural history collections and other biodiversity data. Kurator Web is a set of a user friendly web interface to configure and launch curation workflows while maintaining provenance. Kurator-Akka and the Kurator YesWorkflow data curation software and code are available on GitHub.

For more information about Kurator, please visit our wiki or watch an introductory video (https://vimeo.com/274087877)

File Aggregator:

Aggregates two files into one file.

Controlled Field Assessor:

Creates a report of counts of distinct geographic values and provides recommended values.

Darwinizer:

Creates a new file with as many field names standardized to Darwin Core as possible.

Date Validator:

Validates event date fields and fills in missing dates from atomic event date fields.

Field Value Counter:

Creates a report of counts of distinct values and recommended values for values that are not standard.

Geography Cleaner:

Creates a new occurrences file with standardized geography and original geography saved in new fields.

Georeference Validator:

Performs validation of the georeference fields and fills in or transposes missing or inconsistent coordinates.

Property Parser:

Parses "dynamicProperties" field in a DarwinCore-Archive spreadsheet and creates separate fields for each value.

Geography Assessor:

Creates file containing the recommendations to standardize distinct combinations of higher geography.

Vocabulary Maker:

Creates a vocabulary file with fields for the original values, the standard values, and vetted values.


The Kurator project is a collaborative project led by The University of Illinois Urbana Champaign and Harvard University and other partners and funded by the National Science Foundation. Learn more about the project here (link to: http://wiki.datakurator.org/wiki. Kurator: A Provenance-enabled Workflow Platform and Toolkit to Curate Biodiversity Data Collaborative NSF Awards 1356438 and 1356751