講演・口頭発表等

国際会議
2023年7月26日

The dwctaxon R package for editing and validating taxonomic data

2023 Botanical Society of America Conference
  • Joel H. Nitta
  • ,
  • Wataru Iwasaki

開催年月日
2023年7月22日 - 2023年7月26日
記述言語
英語
会議種別
口頭発表(一般)
開催地
Boise, ID (online)
国・地域
アメリカ合衆国

Darwin Core (DwC) is a data standard that has become widely adopted across biodiversity databases because it enables transfer of biological data in a unified format. The DwC standard for taxonomic data is especially important since taxonomic names are often used as unique identifiers to join data from disparate sources. Here, we introduce a new R package for editing and validating taxonomic data in compliance with DwC, dwctaxon. dwctaxon automates typical taxonomic database management tasks such as transfer of synonyms and filling columns with ID numbers, thereby making workflows both more efficient and less error-prone. It also conducts data validation for typical problems seen in taxonomic data, including checks for nine major error categories. It has been designed to be maximally compatible with DwC while allowing for flexibility in database design. dwctaxon has passed code review at rOpenSci (https://ropensci.org/) and is freely available from https://github.com/ropensci/dwctaxon and the Comprehensive R Archive Network (CRAN).

リンク情報
URL
https://github.com/ropensci/dwctaxon