set_scientific_name#
One of the functions you can use to check certain columns of your data is set_scientific_name()
.
This function aims to check that you have the following Darwin Core Vocabulary Terms:
scientificName
: the scientific name of the species you observed
It can also can check the following:
scientificNameRank
(OPTIONAL): rank of the scientific name you are providing.scientificNameAuthorship
(OPTIONAL): Authors of the species name you are using.
specifying scientificName
#
Like with other set_*
functions, to specify which column you want to rename or change, you
specify it with the Darwin Core term. In this case, it is scientificName
.
>>> my_dwca.set_scientific_name(dataframe=occ,scientificName='Species')
>>> my_dwca.occurrences.head()
scientificName Latitude Longitude Collection_date
0 Corymbia latifolia -13.04 131.07 29/3/2022
1 Eucalyptus tectifica -13.04 131.07 13/9/2022
2 Banksia aemula -33.60 150.72 15/8/2022
3 Eucalyptus sclerophylla -33.60 150.72 16/6/2022
4 Persoonia laurina -33.60 150.72 19/10/2022
what does check_data
and suggest_workflow
say now?#
Note: each of the set_*
functions checks your data for compliance with the
Darwin core standard, but it’s always good to double-check your data.
Now, we can check that our data column do comply with the Darwin Core standard.
>>> my_dwca.set_scientific_name(dataframe=occ,scientificName='Species')
>>> my_dwca.check_dataset()
Number of Errors Pass/Fail Column name
------------------ ----------- --------------
0 ✓ scientificName
══ Results ════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════════
Errors: 0 | Passes: 1
✗ Data does not meet minimum Darwin core requirements
Use corella.suggest_workflow()
However, since we don’t have all of the required columns, we can run suggest_workflow()
again to see how our data is doing this time round.
>>> my_dwca.suggest_workflow()
── Darwin Core terms ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
── All DwC terms ──
Matched 1 of 4 column names to DwC terms:
✓ Matched: scientificName
✗ Unmatched: Collection_date, Latitude, Longitude
── Minimum required DwC terms occurrences ──
Type Matched term(s) Missing term(s)
------------------------- ----------------- ------------------------------------------------
Identifier (at least one) - occurrenceID OR catalogNumber OR recordNumber
Record type - basisOfRecord
Scientific name scientificName -
Location - decimalLatitude, decimalLongitude, geodeticDatum
Date/Time - eventDate
── Suggested workflow ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
── Occurrences ──
To make your occurrences Darwin Core compliant, use the following workflow:
corella.set_occurrences()
corella.set_coordinates()
corella.set_datetime()
Additional functions: set_abundance(), set_collection(), set_individual_traits(), set_license(), set_locality(), set_taxonomy()
Other functions#
To learn more about how to use other functions, go to
Optional functions:
Creating Unique IDs:
Passing Dataset: