PUID - a universally unique identifier

INSTCODE - FAO institute code

NICODE - Code identifying the National Inventory; the code of the country preparing the National Inventory.

ACCENUMB - accession number, unique within bank

COLLNUMB - id assigned by collector

Either:
COLLCODE+ - FAO code of collecting institute

COLLNAME+ - collecting institute name, only used if no FAO number
COLLINSTADDRESS+ - address of collecting institute, only used if no FAO code

COLLMISSID - collecting mission id, id the group uses to identify this one mission it was collected on

GENUS - genus name for taxon, initial uppercase letter

SPECIES - Specific epithet portion of scientific name

SPAUTHOR - species authority

SUBTAXA - additional taxonomic info

SUBTAUTHOR - sub taxon authority

CROPNAME - common crop name e.g. malting barley

ACCENAME+ - any name given to accession

ACQATE - date of acquisition

ORIGCTY - country of origin, ISO 3166-1 code

One of:
LONGITUTE - longitude of collecting site
DECLONGITUDE - decimal longitude of collecting site

One of:
LATITUDE - latitude of collecting site
DECLATITUDE - decimal latitude of collecting site

COORDUNCERT - coordinate uncertainty in m

COORDDATUM - The geodetic datum or spatial reference system upon which the coordinates given in decimal latitude
and decimal longitude are based (e.g. WGS84, ETRS89, NAD83). The GPS uses the WGS84 datum.

GEOREFMETH - georeferencing method e.g. gps

ELEVATION - elevation of collecting site

COLLDATE - collection date of sample

One of:
BREDCODE+ - breeding institute fao code, can be same as INSTCODE
BREDNAME+ - breeding inst name, use if no FAO code

SAMPSTAT - biological status of accession, a number:

  1. Wild
  2. Natural
  3. Semi-natural/wild
  4. Semi-natural/sown
  5. Weedy
  6. Traditional cultivar/landrace
  7. Breeding/research material
  8. Breeder's line
  9. Synthetic population
  10. Hybrid
  11. Founder stock/base population
  12. Inbred line (parent of hybrid cultivar)
  13. Segregating population
  14. Clonal selection
  15. Genetic stock
  16. Mutant (e.g. induced/insertion mutants, tilling populations)
  17. Cytogenetic stocks (e.g. chromosome addition/substitution, aneuploids, amphiploids)
  18. Other genetic stocks (e.g. mapping populations)
  19. Advanced or improved cultivar (conventional breeding methods)
  20. GMO (by genetic engineering)
  21. Other (Elaborate in REMARKS field)

Fields allowing duplicates will have a + after their name, all these seem to be split with semicolons and no spaces

ANCEST - string of ancestral data e.g. mutation found in hanna

COLLSRC - Where this was collected from, a number:
10 Wild habitat
11 Forest or woodland
12 Shrubland
13 Grassland
14 Desert or tundra
15 Aquatic habitat
20 Farm or cultivated habitat
21 Field
22 Orchard
23 Backyard, kitchen or home garden (urban, peri-urban or rural)
24 Fallow land
25 Pasture
26 Farm store
27 Threshing floor
28 Park
30 Market or shop
40 Institute, Experimental station, Research organization, Genebank
50 Seed company
60 Weedy, disturbed or ruderal habitat
61 Roadside
62 Field margin
99 Other (Elaborate in REMARKS field)

STORAGE+ - how we store the germplasm, a number:

10 Seed collection
11 Short term
12 Medium term
13 Long term
20 field collection
30 In vitro collection
40 Cryopreserved collection
50 DNA collection
99 Other (elaborate in REMARKS field)

MLSSTAT - whether its available under the MLS international treaty (a treaty allowing scientists to share genetic data on common crops)

REMARKS - a space to add extra info on areas where

ACCEURL - URL linking to additional data about the accession

AEGISSTAT - whether the accession is part of AEGIS (a big european collection of accessions)

HISTORIC - whether its actively maintained or not

OTHERNUMB+ - other identifiers used

DUPLSITE+ - FAO code for site holding backup

DUPLSTNAME+ - name of institute where duplicate stored

One of:
DONORCODE - donor inst code FAO
DONORNAME - name of inst if no FAO code

DONORNUMB - id given to accession by donor, follows ACCENUMB standard

Eurisco data schema

The format data will be sent to the project to be put into GRIN-Global. Each box represents one column/field within the table.

taxonomy species relation

Species name

Species authority

taxonomy family relation

Family name

Family authority

taxonomy genus relation

Genus name

Genus authority

taxonomy common name relation

Genus Id + Species Id

Common name + simplified name

Origins of name + Citation for name

accessions relation

Taxonomy in relation to taxonomy table, specifically the species which then cascades up

Is backed up

Name of backup site 2 in relation to sites table

FAO number of backup site 2 in relation to sites table

Shortened name of backup site 2 in relation to sites table

Accession id in 3 parts

Data in Grin Global

The different locations data will be added into GRIN-Global, the database is built of 40+ different tables so I've just put the useful ones here. Yellow boxes are the names of each table and the boxes below are the colums/fields.

Ive added a selection of fields from each table, a number of which arnt used but they're there to give an idea of the tables contents.

Also note the list starting with a blue box, this is a list of new fields we will have to add.

Arrows connect EURISCO fields to our different tables. Pink arrows are links I'm unsure of.

longitude

uncertainty

formatted locality

georeference datum

GG's lat and longitude are in one form, eurisco offers 2

georeference protocol

georeference annotation

environment description

Notes

Inventory mainenence policy relation

Name

How its stored, e.g. cutting

How we measure it e.g. grams, number of cuttings

Web visible comment on maintenence

Distribution form

How much we distribute by defauly when requested

Measurement we distribute in

Regeneration method

Name of backup site 1 in relation to sites table

FAO number of backup site 1 in relation to sites table

Shortened name of backup site 1 in relation to sites table

lifeform type, e.g. shrub, biennial

Whether it should be visible on the web

Whether its active or inactive

form initially recieved in e.g. cutting or seed

When it was recieved

Level of improvement e.g. wild, cultivated

source type, collected, donate, developed

source date format

quantity collected

unit of quantity collected

source date

form collected in

number of plants sampled

elevation m

latitude

accession source relation

Country

type of location e.g. forest

site relation

Name of site

FAO Acronym

FAO institute number

Add an "additional taxonomic info" field to species taxonomy maybe? or make a new table for additional info

Add collmissid field to accessions source table

Adding new collector and donor fields to accessions source table. These would just point to collaborator entries.

Add collmissid field to accessions source table

Add new breeder field to source, links to collaborator entry

Add ancestry field to accession relation

Add MLS status boolean to accession(?) table

Add AEGIS status boolean to accession(?) table

Add URL field to accession table

Add accession name field to accession table

Add an Other Numbers field to accession table

Add donornumber field to accession table

Add EURISCO PID field to accession table

Add National Inventory code field to accession relation

New fields that will need adding

For this to work, STORAGE, COLLSRC and SAMPSTAT will need to be new custom fields and then we will have 3 REMARK fields, one for each one...