skills/materials-databases/SKILL.md
Expert assistant for accessing materials databases (AFLOW and Materials Project) - query crystal structures, materials properties, thermodynamic data, and computational results from comprehensive databases
npx skillsauth add Hongyu-yu/matsci-ai-skills materials-databasesInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
You are an expert assistant for accessing and querying materials science databases, specifically AFLOW and Materials Project. Help users retrieve crystal structures, materials properties, and computational data efficiently.
This skill enables access to two major materials databases:
AFLOW (Automatic Flow for Materials Discovery)
Materials Project (MP)
# Install the Materials Project API client
pip install mp-api
# Alternative: with conda
conda install -c conda-forge mp-api
AFLOW uses REST API - no Python package installation required. However, for convenience:
# Optional: Install requests for API calls
pip install requests
# Optional: Install aflow Python package (community-maintained)
pip install aflow
# For structure manipulation and visualization
pip install pymatgen ase
# For data analysis
pip install pandas numpy matplotlib
Get an API key:
Set up authentication:
Option A: Environment variable (recommended)
export MP_API_KEY="your_api_key_here"
Option B: Configuration file
# Create ~/.config/.mpapi.json or ~/.pmgrc.yaml
echo '{"MAPI_KEY": "your_api_key_here"}' > ~/.config/.mpapi.json
Option C: Pass directly in code
from mp_api.client import MPRester
with MPRester("your_api_key_here") as mpr:
# Your code here
pass
No API key required - AFLOW API is publicly accessible.
Search by formula:
from mp_api.client import MPRester
with MPRester(api_key="YOUR_API_KEY") as mpr:
# Search for all Silicon entries
docs = mpr.materials.summary.search(formula="Si")
# Get specific properties
docs = mpr.materials.summary.search(
formula="Fe2O3",
fields=["material_id", "formula_pretty", "band_gap", "energy_per_atom"]
)
Search by material ID:
with MPRester() as mpr: # Uses env var or config file
structure = mpr.get_structure_by_material_id("mp-149")
doc = mpr.materials.summary.get_data_by_id("mp-149")
Search by criteria:
with MPRester() as mpr:
# Find materials with band gap between 1-3 eV
docs = mpr.materials.summary.search(
band_gap=(1, 3),
elements=["O", "Ti"],
num_elements=2
)
# Find stable materials
docs = mpr.materials.summary.search(
energy_above_hull=(0, 0.01), # Nearly stable
fields=["material_id", "formula_pretty", "energy_above_hull"]
)
Available data types:
materials.summary - General materials propertiesmaterials.thermo - Thermodynamic datamaterials.electronic_structure - Band structures, DOSmaterials.phonon - Phonon band structuresmaterials.elasticity - Elastic tensorsmaterials.surface_properties - Surface energiesmolecules - Molecular structures and propertiesBasic URL structure:
http://aflowlib.duke.edu/AFLOWDATA/ICSD_WEB/<system>/<file>?<directives>
Common queries using REST:
import requests
# Get all keywords for a material
url = "http://aflowlib.duke.edu/AFLOWDATA/ICSD_WEB/FCC/Ag1/?keywords"
response = requests.get(url)
data = response.text
# Get specific property (e.g., enthalpy)
url = "http://aflowlib.duke.edu/AFLOWDATA/ICSD_WEB/FCC/Ag1/?enthalpy_formation_atom"
response = requests.get(url)
# Search using AFLUX (AFLOW search language)
url = "http://aflowlib.duke.edu/search/API/?species(Au),Egap(5*),catalog(ICSD)"
response = requests.get(url)
Using aflow Python package (optional):
import aflow
# Search for materials
results = aflow.search(filter='species(Au),Egap(5*)')
for result in results:
print(result.enthalpy_formation_atom)
print(result.Egap)
structure = result.atoms # Get ASE Atoms object
Common AFLOW directives:
?keywords - List all available properties?enthalpy_formation_atom - Formation enthalpy per atom?Egap - Band gap?volume_cell - Unit cell volume?geometry - Crystal structure (POSCAR format)?files - List all available filesAFLUX search syntax:
species(element1,element2) # Chemical system
Egap(min*,max*) # Band gap range (eV)
enthalpy_formation_atom(min*,max*) # Formation enthalpy range
catalog(ICSD) # Database catalog
# Example: Cross-reference findings
from mp_api.client import MPRester
import requests
# Find in Materials Project
with MPRester() as mpr:
mp_docs = mpr.materials.summary.search(
formula="TiO2",
fields=["material_id", "band_gap", "energy_per_atom"]
)
# Cross-check with AFLOW
aflow_url = "http://aflowlib.duke.edu/search/API/?species(Ti,O),nspecies(2)"
aflow_data = requests.get(aflow_url).json()
Use specific queries - Request only needed fields to reduce data transfer
docs = mpr.materials.summary.search(
formula="Li",
fields=["material_id", "formula_pretty", "band_gap"] # Specify fields
)
Paginate large results - Use chunk_size for large queries
docs = mpr.materials.summary.search(
elements=["O"],
num_chunks=10, # Fetch in chunks
chunk_size=1000
)
Cache results locally - Save API results to avoid repeated queries
import pickle
# Save results
with open("mp_results.pkl", "wb") as f:
pickle.dump(docs, f)
Handle structures properly - Convert between formats as needed
from pymatgen.io.ase import AseAtomsAdaptor
# MP Structure → ASE Atoms
atoms = AseAtomsAdaptor.get_atoms(structure)
# ASE Atoms → MP Structure
structure = AseAtomsAdaptor.get_structure(atoms)
Check API status - Materials Project has rate limits (typically generous)
from mp_api.client import MPRester
from mp_api.client.core import MPRestError
try:
with MPRester() as mpr:
docs = mpr.materials.summary.search(formula="InvalidFormula")
except MPRestError as e:
print(f"API Error: {e}")
except Exception as e:
print(f"Unexpected error: {e}")
# Materials Project
with MPRester() as mpr:
docs = mpr.materials.summary.search(
formula="GaN",
fields=["material_id", "band_gap", "is_gap_direct"]
)
for doc in docs:
print(f"{doc.material_id}: {doc.band_gap} eV (direct: {doc.is_gap_direct})")
# AFLOW
import requests
url = "http://aflowlib.duke.edu/search/API/?species(Ga,N),Egap"
response = requests.get(url)
# Materials Project - returns pymatgen Structure
with MPRester() as mpr:
structure = mpr.get_structure_by_material_id("mp-149")
structure.to(filename="POSCAR")
# AFLOW - returns POSCAR format text
url = "http://aflowlib.duke.edu/AFLOWDATA/ICSD_WEB/FCC/Ag1/?geometry"
poscar = requests.get(url).text
# Find thermodynamically stable oxides with specific properties
with MPRester() as mpr:
docs = mpr.materials.summary.search(
elements=["O"],
energy_above_hull=(0, 0.05), # Stable or nearly stable
band_gap=(1.0, 4.0), # Semiconducting
num_elements=(2, 3), # Binary or ternary
fields=["material_id", "formula_pretty", "band_gap", "energy_above_hull"]
)
references/materials-project.md for detailed MP API documentationreferences/aflow.md for AFLOW REST API and AFLUX syntaxexamples/ for complete working scriptstools
Spreadsheet toolkit (.xlsx/.csv). Create/edit with formulas/formatting, analyze data, visualization, recalculate formulas, for spreadsheet processing and analysis.
tools
Expert assistant for VASP (Vienna Ab initio Simulation Package) calculations - input file generation, parameter selection, workflow setup, and best practices for accurate DFT calculations
data-ai
This skill should be used when working with pre-trained transformer models for natural language processing, computer vision, audio, or multimodal tasks. Use for text generation, classification, question answering, translation, summarization, image classification, object detection, speech recognition, and fine-tuning models on custom datasets.
tools
Graph Neural Networks (PyG). Node/graph classification, link prediction, GCN, GAT, GraphSAGE, heterogeneous graphs, molecular property prediction, for geometric deep learning.