Citizen science is scientific research conducted, in whole or in part, by amateur (or non-professional) scientists. Biodiversity observations by citizen scientists have become significant in the last 10 years thanks to projects like:

Citizen science is scientific research conducted, in whole or in part, by amateur (or non-professional) scientists. Citizen science is sometimes described as “public participation in scientific research,” participatory monitoring, and participatory action research (wikipedia).

The rise of citizen science on the GBIF network

Citizen scientists have been contributing a large and growing percentage of records to the GBIF network.

50% of occurrence records on GBIF are citizen science observations

For the last few years, citizen science observations make up around 50% of the biodiversity knowledge on the GBIF network. The eBird dataset is by far the largest contributor and when this dataset is updated we can see a spike in the percentage graph. In between eBird updates, non-citizen science datasets decrease the percentage.

Largest datasets on GBIF are citizen science datasets

6 of the top 10 datasets on the GBIF network are citizen science datasets.

Birds are very popular in citizen science

617 of the top 800 species observed by citizen scientists are birds.

  • Top bird 1st place (seagull, 8M records)
  • Top insect 426th place (red admiral butterfly, 200k records)
  • Top mammal 540th place (row deer, 140k records)
  • Top plant 645th place (common nettle, 100k records)

Here I plot the top 800 species in citizen science in terms of occurrence count.

interactive version of graph above

There are > 8 000 000 seagull occurrence records

roe deer

Larus argentatus subsp. argenteus by G.Droege via Botanic Garden and Botanical Museum Berlin-Dahlem. Photo licensed under CC BY-SA 3.0

Top insect is the red admiral butterfly 200k records

roe deer

Vanessa atalanta by W.-H. Kusber via BoBO - Botanic Garden and Botanical Museum Berlin-Dahlem Observations. Photo licensed under CC BY-SA 4.0

Top mammal is the roe deer 140k records

<img src="/post/2018-12-10-gbif-citizen-science-data_files/CapreolusCapreolus.JPG" alt=“roe deer” width=50%">

Capreolus capreolus by Trine Brevig via the Norwegian Species Observation Service. Photo licensed under CC BY 4.0

Top plant is the common nettle 100k records

<img src="/post/2018-12-10-gbif-citizen-science-data_files/UrticaDioica.jpg" alt=“common nettle” width=50%">

Urtica dioica subsp. dioica by Peter de Lange via iNaturalist. No copyright.

Citizen science is less biodiverse than professional science

Here I plot the number of unique genera observed by citizen scientists against those recorded by non-citizen science datasets or likely collected by professional scientists.

So while citizen science is an important and growing part of biodiversity data, it cannot replace well-curated professional museum collections and other professional sources.

While citizen scientists have observed over 50% of GBIF records (500 Million!), they have only 20% of the genera observed by professional scientists (50k out of around 200k genera). So while citizen science will probably not replace professional science, it is still and important and growing part of biodiveristy monitoring and data collection. No doubt the biodiversity of citizen science observations will continue to increase as more specialists start using apps like iNaturalist. Perhaps citizen scientists will observe 60k or 70k genera by the end of 2019, and do it for free!

Technical details

  • Link to gitHub with data and R code.
  • Link to a previous blog post where we describe how we classified citizen science datasets.
  • As of the writing of this blog there are 412 datasets classified as citizen science on the GBIF network.

You too can get a list of the citizen science datasets using the gbif api. Paste the following link into a web browser and get a list of citizen science datasets via the GBIF api.

http://api.gbif.org/v1/dataset?machineTagNamespace=citizenScience.mgrosjean.gbif.org&limit=500

There is also a static csv table here.

Do you know a citizen science dataset that is not in the list? Leave a comment.