skills/dpdata-driver/SKILL.md
Use dpdata Python Driver plugins to label systems (energies/forces/virials) via System.predict(), list available drivers, and build Driver objects (ase/deepmd/gaussian/sqm/hybrid). Use when working with dpdata Python API (not CLI) and you need driver-based energy/force prediction, plugin registration keys, or examples of using dpdata with ASE calculators or DeePMD models.
npx skillsauth add deepmodeling/dpdata dpdata-driverInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Use dpdata “driver plugins” to label a dpdata.System (predict energies/forces/virials) and obtain a dpdata.LabeledSystem.
System into a LabeledSystem by computing:
energies (required)forces (optional but common)virials (optional)In dpdata, this is exposed as:
System.predict(*args, driver="dp", **kwargs) -> LabeledSystemdriver can be:
"ase", "dp", "gaussian"Driver.get_driver("ase")(...)When unsure what drivers exist in this dpdata version/env, query them at runtime:
import dpdata
from dpdata.driver import Driver
print(sorted(Driver.get_drivers().keys()))
import dpdata ensures built-in plugins are loaded before listing registered drivers.
In the current repo state, keys include:
asedp / deepmd / deepmd-kitgaussiansqmhybrid(Exact set depends on dpdata version and installed extras.)
import dpdata
from dpdata.system import System
sys = System("input.xyz", fmt="xyz")
ls = sys.predict(driver="ase", calculator=...) # returns dpdata.LabeledSystem
assert "energies" in ls.data
# optional:
# assert "forces" in ls.data
# assert "virials" in ls.data
This is the easiest fully runnable example because it doesn’t require external QM software.
Dependencies (recommended): declare script dependencies with uv inline metadata, then run with uv run.
# /// script
# requires-python = ">=3.8"
# dependencies = [
# "dpdata",
# "numpy",
# "ase",
# ]
# ///
Script:
from pathlib import Path
import numpy as np
from ase.calculators.lj import LennardJones
from dpdata.system import System
# write a tiny molecule
Path("tmp.xyz").write_text("""2\n\nH 0 0 0\nH 0 0 0.74\n""")
sys = System("tmp.xyz", fmt="xyz")
ls = sys.predict(driver="ase", calculator=LennardJones())
print("energies", np.array(ls.data["energies"]))
print("forces shape", np.array(ls.data["forces"]).shape)
if "virials" in ls.data:
print("virials shape", np.array(ls.data["virials"]).shape)
else:
print("virials: <not provided by this driver/calculator>")
from ase.calculators.lj import LennardJones
from dpdata.driver import Driver
from dpdata.system import System
sys = System("tmp.xyz", fmt="xyz")
ase_driver = Driver.get_driver("ase")(calculator=LennardJones())
ls = sys.predict(driver=ase_driver)
Use driver="hybrid" to sum energies/forces/virials from multiple drivers.
The HybridDriver accepts drivers=[ ... ] where each item is either:
Driver instance{"type": "sqm", ...} (type is the driver key)Example (structure only; may require external executables):
from dpdata.driver import Driver
hyb = Driver.get_driver("hybrid")(
drivers=[
{"type": "sqm", "qm_theory": "DFTB3"},
{"type": "dp", "dp": "frozen_model.pb"},
]
)
# ls = sys.predict(driver=hyb)
dp requires deepmd-kit + a model filegaussian requires Gaussian and a valid executable (default g16)sqm requires AmberTools sqmtools
Create and install dpdata plugins (especially custom Format readers/writers) using Format.register(...) and pyproject.toml entry_points under 'dpdata.plugins'. Use when extending dpdata with new formats or distributing plugins as separate Python packages.
tools
Minimize geometries with dpdata minimizer plugins via System.minimize(), including how minimizers relate to drivers (ASEMinimizer needs a dpdata Driver) and how to list supported minimizers (ase/sqm). Use when doing geometry optimization/minimization through dpdata Python API.
tools
Convert and manipulate atomic simulation data formats using dpdata CLI. Use when converting between DFT/MD output formats (VASP, LAMMPS, QE, CP2K, Gaussian, ABACUS, etc.), preparing training data for DeePMD-kit, or working with DeePMD formats. Supports 50+ formats including deepmd/raw, deepmd/comp, deepmd/npy, deepmd/hdf5.
tools
Use when work should span one or more detached tasks but still behave like one job with a single owner context. TaskFlow is the durable flow substrate under authoring layers like Lobster, ACPX, plugins, or plain code. Keep conditional logic in the caller; use TaskFlow for flow identity, child-task linkage, waiting state, revision-checked mutations, and user-facing emergence.