Installation
About
It brings together several aspects of biodiversity data-cleaning in one place. 'bdc' is organized in thematic modules related to different biodiversity dimensions, including 1) Merge datasets: standardization and integration of different datasets; 2) Pre-filter: flagging and removal of invalid or non-interpretable information, followed by data amendments; 3) Taxonomy: cleaning, parsing, and harmonization of scientific names from several taxonomic groups against taxonomic databases locally stored through the application of exact and partial matching algorithms; 4) Space: flagging of erroneous, suspect, and low-precision geographic coordinates; and 5) Time: flagging and, whenever possible, correction of inconsistent collection date. In addition, it contains features to visualize, document, and report data quality – which is essential for making data quality assessment transparent and reproducible. The reference for the methodology is Bruno et al. (2022) doi:10.1111/2041-210X.13868.
brunobrr.github.io/bdc/ (website) https://github.com/brunobrr/bdc | |
brunobrr.github.io/bdc/ (website) https://github.com/brunobrr/bdc | |
Bug report | File report |
Key Metrics
Downloads
Yesterday | 25 0% |
Last 7 days | 58 -25% |
Last 30 days | 359 -6% |
Last 90 days | 1.313 +27% |
Last 365 days | 4.827 -3% |