skills/43-wentorai-research-plugins/skills/domains/ecology/gbif-api/SKILL.md
Global biodiversity data API for species occurrences and datasets
npx skillsauth add brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research gbif-apiInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
The Global Biodiversity Information Facility (GBIF) is an international network and data infrastructure funded by governments worldwide, aimed at providing open access to biodiversity data. GBIF aggregates hundreds of millions of species occurrence records from natural history collections, citizen science platforms, monitoring networks, and published literature across the globe.
The GBIF API provides programmatic access to this vast repository of biodiversity data. Researchers can search for species occurrences by taxonomy, geography, time period, and dataset. The API also supports taxonomic name matching, dataset discovery, and species profile lookups. It serves as a foundational resource for ecological research, conservation planning, biogeography, and environmental impact assessments.
Ecologists, conservation biologists, biogeographers, and environmental scientists rely on the GBIF API to retrieve georeferenced occurrence data for species distribution modeling, climate change impact analysis, invasive species tracking, and biodiversity hotspot identification. The data is freely available under open data licenses.
No authentication required for read access. The GBIF API is publicly accessible without any API key or token. All search and retrieval endpoints are open. User authentication is only required for data publishing operations (creating datasets and uploading occurrences), which requires a GBIF account.
Search for georeferenced biodiversity observation and specimen records across all GBIF-indexed datasets.
GET https://api.gbif.org/v1/occurrence/search| Parameter | Type | Required | Description |
|-----------------|--------|----------|------------------------------------------------------|
| q | string | No | Full-text search query |
| taxonKey | int | No | GBIF backbone taxonomy key |
| scientificName | string | No | Scientific name to filter by |
| country | string | No | ISO 3166-1 alpha-2 country code |
| hasCoordinate | bool | No | Filter for georeferenced records only |
| year | string | No | Year or range (e.g., 2020,2024) |
| limit | int | No | Number of results (default 20, max 300) |
| offset | int | No | Pagination offset |
curl "https://api.gbif.org/v1/occurrence/search?scientificName=Panthera+tigris&hasCoordinate=true&limit=10"
count (total matches), results array with key, scientificName, decimalLatitude, decimalLongitude, country, basisOfRecord, eventDate, datasetKey, publishingOrgKey, and media links.Match a species name against the GBIF backbone taxonomy to resolve canonical names and get taxonomy keys.
GET https://api.gbif.org/v1/species/match| Parameter | Type | Required | Description | |-----------|--------|----------|------------------------------------------| | name | string | Yes | Scientific name to match | | kingdom | string | No | Kingdom filter for disambiguation | | strict | bool | No | If true, only return exact matches |
curl "https://api.gbif.org/v1/species/match?name=Homo+sapiens"
usageKey, scientificName, canonicalName, rank, status, kingdom, phylum, class, order, family, genus, species, confidence score, and matchType.Search for and retrieve metadata about GBIF-indexed datasets from publishers worldwide.
GET https://api.gbif.org/v1/dataset| Parameter | Type | Required | Description |
|-----------------|--------|----------|-------------------------------------------------|
| q | string | No | Full-text search query |
| type | string | No | Dataset type: OCCURRENCE, CHECKLIST, etc. |
| publishingOrg | string | No | Publishing organization UUID |
| limit | int | No | Number of results (default 20, max 1000) |
| offset | int | No | Pagination offset |
curl "https://api.gbif.org/v1/dataset?q=bird+monitoring&type=OCCURRENCE&limit=5"
count, results array with key, title, description, type, publishingOrganizationKey, license, recordCount, and endpoints.No formal rate limits are enforced on the GBIF API. However, GBIF recommends responsible use patterns. Large data downloads (millions of records) should use the asynchronous download API at https://api.gbif.org/v1/occurrence/download/request rather than paginating through the search endpoint. The search endpoint is limited to 100,000 records maximum per query via pagination.
Retrieve georeferenced occurrence data for species distribution modeling:
import requests
params = {
"taxonKey": 2480498, # Panthera tigris
"hasCoordinate": True,
"limit": 300
}
resp = requests.get("https://api.gbif.org/v1/occurrence/search", params=params)
data = resp.json()
coordinates = [(r["decimalLongitude"], r["decimalLatitude"])
for r in data["results"]
if "decimalLongitude" in r and "decimalLatitude" in r]
print(f"Retrieved {len(coordinates)} georeferenced occurrences of {data['results'][0]['scientificName']}")
Resolve a list of species names against the GBIF backbone taxonomy:
import requests
names = ["Homo sapiens", "Canis lupus", "Quercus robur", "Drosophila melanogaster"]
for name in names:
resp = requests.get("https://api.gbif.org/v1/species/match", params={"name": name})
match = resp.json()
print(f"{name} -> {match['canonicalName']} (key: {match['usageKey']}, confidence: {match['confidence']})")
For large-scale analyses requiring millions of records, use the asynchronous download API:
curl -X POST "https://api.gbif.org/v1/occurrence/download/request" \
-H "Content-Type: application/json" \
-u username:password \
-d '{"creator":"username","predicate":{"type":"equals","key":"TAXON_KEY","value":"2480498"}}'
development
Conduct rigorous thematic analysis (TA) of qualitative data following Braun and Clarke's (2006) six-phase framework. Use whenever the user mentions 'thematic analysis', 'TA', 'Braun and Clarke', 'qualitative coding', 'identifying themes', or asks for help analysing interviews, focus groups, open-ended survey responses, or transcripts to identify patterns. Also trigger for questions about inductive vs theoretical coding, semantic vs latent themes, essentialist vs constructionist epistemology, building a thematic map, or writing up a qualitative findings section. Covers all six phases, the four upfront analytic decisions, the 15-point quality checklist, and the five common pitfalls. Produces a Word document write-up and an annotated thematic map. Does NOT cover IPA, grounded theory, discourse analysis, conversation analysis, or narrative analysis — use a different method for those.
development
Guide users through writing a systematic literature review (SLR) following the PRISMA 2020 framework. Use this skill whenever the user mentions 'systematic review', 'systematic literature review', 'SLR', 'PRISMA', 'PRISMA 2020', 'PRISMA flow diagram', 'PRISMA checklist', or asks for help writing, structuring, or auditing a literature review that follows reporting guidelines. Also trigger when the user asks about inclusion/exclusion criteria for a review, search strategies for databases like Scopus/WoS/PubMed, study selection processes, risk of bias assessment, or narrative synthesis for a review paper. This skill covers the full PRISMA 2020 checklist (27 items), produces a Word document manuscript in strict journal article format, generates an annotated PRISMA flow diagram, and enforces APA 7th Edition referencing throughout. It does NOT cover meta-analysis or statistical pooling. By Chuah Kee Man.
testing
Performs placebo-in-time sensitivity analysis with hierarchical null model and optional Bayesian assurance. Use when checking model robustness, verifying lack of pre-intervention effects, or estimating study power.
data-ai
Fit, summarize, plot, and interpret a chosen CausalPy experiment. Use after the causal method has been selected, including when configuring PyMC/sklearn models and scale-aware custom priors.