PUID - a universally unique identifier
INSTCODE - FAO institute code
NICODE - Code identifying the National Inventory; the code of the country preparing the National Inventory.
ACCENUMB - accession number, unique within bank
COLLNUMB - id assigned by collector
Either:
COLLCODE+ - FAO code of collecting institute
COLLNAME+ - collecting institute name, only used if no FAO number
COLLINSTADDRESS+ - address of collecting institute, only used if no FAO code
COLLMISSID - collecting mission id, id the group uses to identify this one mission it was collected on
GENUS - genus name for taxon, initial uppercase letter
SPECIES - Specific epithet portion of scientific name
SPAUTHOR - species authority
SUBTAXA - additional taxonomic info
SUBTAUTHOR - sub taxon authority
CROPNAME - common crop name e.g. malting barley
ACCENAME+ - any name given to accession
ACQATE - date of acquisition
ORIGCTY - country of origin, ISO 3166-1 code
One of:
LONGITUTE - longitude of collecting site
DECLONGITUDE - decimal longitude of collecting site
One of:
LATITUDE - latitude of collecting site
DECLATITUDE - decimal latitude of collecting site
COORDUNCERT - coordinate uncertainty in m
COORDDATUM - The geodetic datum or spatial reference system upon which the coordinates given in decimal latitude
and decimal longitude are based (e.g. WGS84, ETRS89, NAD83). The GPS uses the WGS84 datum.
GEOREFMETH - georeferencing method e.g. gps
ELEVATION - elevation of collecting site
COLLDATE - collection date of sample
One of:
BREDCODE+ - breeding institute fao code, can be same as INSTCODE
BREDNAME+ - breeding inst name, use if no FAO code
SAMPSTAT - biological status of accession, a number:
- Wild
- Natural
- Semi-natural/wild
- Semi-natural/sown
- Weedy
- Traditional cultivar/landrace
- Breeding/research material
- Breeder's line
- Synthetic population
- Hybrid
- Founder stock/base population
- Inbred line (parent of hybrid cultivar)
- Segregating population
- Clonal selection
- Genetic stock
- Mutant (e.g. induced/insertion mutants, tilling populations)
- Cytogenetic stocks (e.g. chromosome addition/substitution, aneuploids, amphiploids)
- Other genetic stocks (e.g. mapping populations)
- Advanced or improved cultivar (conventional breeding methods)
- GMO (by genetic engineering)
- Other (Elaborate in REMARKS field)
Fields allowing duplicates will have a + after their name, all these seem to be split with semicolons and no spaces
ANCEST - string of ancestral data e.g. mutation found in hanna
COLLSRC - Where this was collected from, a number:
10 Wild habitat
11 Forest or woodland
12 Shrubland
13 Grassland
14 Desert or tundra
15 Aquatic habitat
20 Farm or cultivated habitat
21 Field
22 Orchard
23 Backyard, kitchen or home garden (urban, peri-urban or rural)
24 Fallow land
25 Pasture
26 Farm store
27 Threshing floor
28 Park
30 Market or shop
40 Institute, Experimental station, Research organization, Genebank
50 Seed company
60 Weedy, disturbed or ruderal habitat
61 Roadside
62 Field margin
99 Other (Elaborate in REMARKS field)
STORAGE+ - how we store the germplasm, a number:
10 Seed collection
11 Short term
12 Medium term
13 Long term
20 field collection
30 In vitro collection
40 Cryopreserved collection
50 DNA collection
99 Other (elaborate in REMARKS field)
MLSSTAT - whether its available under the MLS international treaty (a treaty allowing scientists to share genetic data on common crops)
REMARKS - a space to add extra info on areas where
ACCEURL - URL linking to additional data about the accession
AEGISSTAT - whether the accession is part of AEGIS (a big european collection of accessions)
HISTORIC - whether its actively maintained or not
OTHERNUMB+ - other identifiers used
DUPLSITE+ - FAO code for site holding backup
DUPLSTNAME+ - name of institute where duplicate stored
One of:
DONORCODE - donor inst code FAO
DONORNAME - name of inst if no FAO code
DONORNUMB - id given to accession by donor, follows ACCENUMB standard
Eurisco data schema
The format data will be sent to the project to be put into GRIN-Global. Each box represents one column/field within the table.
taxonomy species relation
Species name
Species authority
taxonomy family relation
Family name
Family authority
taxonomy genus relation
Genus name
Genus authority
taxonomy common name relation
Genus Id + Species Id
Common name + simplified name
Origins of name + Citation for name
accessions relation
Taxonomy in relation to taxonomy table, specifically the species which then cascades up
Is backed up
Name of backup site 2 in relation to sites table
FAO number of backup site 2 in relation to sites table
Shortened name of backup site 2 in relation to sites table
Accession id in 3 parts
Data in Grin Global
The different locations data will be added into GRIN-Global, the database is built of 40+ different tables so I've just put the useful ones here. Yellow boxes are the names of each table and the boxes below are the colums/fields.
Ive added a selection of fields from each table, a number of which arnt used but they're there to give an idea of the tables contents.
Also note the list starting with a blue box, this is a list of new fields we will have to add.
Arrows connect EURISCO fields to our different tables. Pink arrows are links I'm unsure of.
longitude
uncertainty
formatted locality
georeference datum
GG's lat and longitude are in one form, eurisco offers 2
georeference protocol
georeference annotation
environment description
Notes
Inventory mainenence policy relation
Name
How its stored, e.g. cutting
How we measure it e.g. grams, number of cuttings
Web visible comment on maintenence
Distribution form
How much we distribute by defauly when requested
Measurement we distribute in
Regeneration method
Name of backup site 1 in relation to sites table
FAO number of backup site 1 in relation to sites table
Shortened name of backup site 1 in relation to sites table
lifeform type, e.g. shrub, biennial
Whether it should be visible on the web
Whether its active or inactive
form initially recieved in e.g. cutting or seed
When it was recieved
Level of improvement e.g. wild, cultivated
source type, collected, donate, developed
source date format
quantity collected
unit of quantity collected
source date
form collected in
number of plants sampled
elevation m
latitude
accession source relation
Country
type of location e.g. forest
site relation
Name of site
FAO Acronym
FAO institute number
Add an "additional taxonomic info" field to species taxonomy maybe? or make a new table for additional info
Add collmissid field to accessions source table
Adding new collector and donor fields to accessions source table. These would just point to collaborator entries.
Add collmissid field to accessions source table
Add new breeder field to source, links to collaborator entry
Add ancestry field to accession relation
Add MLS status boolean to accession(?) table
Add AEGIS status boolean to accession(?) table
Add URL field to accession table
Add accession name field to accession table
Add an Other Numbers field to accession table
Add donornumber field to accession table
Add EURISCO PID field to accession table
Add National Inventory code field to accession relation
New fields that will need adding
For this to work, STORAGE, COLLSRC and SAMPSTAT will need to be new custom fields and then we will have 3 REMARK fields, one for each one...