skills/43-wentorai-research-plugins/skills/literature/metadata/h-index-guide/SKILL.md
Understanding and calculating research impact metrics
npx skillsauth add brycewang-stanford/Awesome-Agent-Skills-for-Empirical-Research h-index-guideInstall this skill globally with one command. Works with Claude Code, Cursor, and Windsurf.
3 of 9 scanners reported clean
Some scanners were skipped, did not run, or reported a non-clean status. Review each row below.
Understand, calculate, and responsibly interpret bibliometric indicators including h-index, impact factor, and related metrics.
The h-index (Hirsch index) is defined as: a researcher has an h-index of h if h of their papers have each been cited at least h times.
Example: If a researcher has published 20 papers with citation counts [120, 80, 55, 40, 22, 18, 15, 12, 10, 8, 5, 3, 2, 2, 1, 1, 0, 0, 0, 0], their h-index is 10 (10 papers with at least 10 citations each).
def calculate_h_index(citation_counts):
"""Calculate h-index from a list of citation counts."""
sorted_counts = sorted(citation_counts, reverse=True)
h = 0
for i, count in enumerate(sorted_counts):
if count >= i + 1:
h = i + 1
else:
break
return h
# Example
citations = [120, 80, 55, 40, 22, 18, 15, 12, 10, 8, 5, 3, 2, 2, 1, 1, 0, 0, 0, 0]
print(f"h-index: {calculate_h_index(citations)}") # Output: 10
| Metric | Definition | Advantage | |--------|-----------|-----------| | h-index | h papers with >= h citations | Simple, robust to outliers | | i10-index | Number of papers with >= 10 citations | Intuitive threshold (Google Scholar uses this) | | g-index | Largest g such that top g papers have >= g^2 total citations | Rewards highly cited papers more | | m-quotient | h-index divided by years since first publication | Normalizes for career length | | hI-norm | h-index divided by average number of co-authors | Adjusts for team size |
def calculate_g_index(citation_counts):
"""Calculate g-index from citation counts."""
sorted_counts = sorted(citation_counts, reverse=True)
cumulative = 0
g = 0
for i, count in enumerate(sorted_counts):
cumulative += count
if cumulative >= (i + 1) ** 2:
g = i + 1
return g
def calculate_i10_index(citation_counts):
"""Calculate i10-index."""
return sum(1 for c in citation_counts if c >= 10)
print(f"g-index: {calculate_g_index(citations)}") # Output: 19
print(f"i10-index: {calculate_i10_index(citations)}") # Output: 10
Published annually by Clarivate in the Journal Citation Reports (JCR). The 2-year impact factor for year Y is:
JIF(Y) = (Citations in Y to articles published in Y-1 and Y-2)
/ (Number of citable items published in Y-1 and Y-2)
| Metric | Provider | Window | Notable Features | |--------|----------|--------|------------------| | Impact Factor | Clarivate (JCR) | 2-year or 5-year | Gold standard, subscription only | | CiteScore | Scopus (Elsevier) | 4-year | Free, includes all document types | | SJR (Scimago) | Scopus data | 3-year | Weights citations by journal prestige (PageRank-like) | | SNIP | Scopus data | 3-year | Normalizes for citation potential of each field | | h5-index | Google Scholar | 5-year | Free, h-index applied to a journal |
import requests
# Using the OpenAlex API to get journal/source information
journal_name = "Nature"
response = requests.get(
"https://api.openalex.org/sources",
params={"filter": f"display_name.search:{journal_name}", "per_page": 5}
)
results = response.json()["results"]
for source in results:
print(f"Name: {source['display_name']}")
print(f" ISSN: {source.get('issn_l', 'N/A')}")
print(f" Works count: {source.get('works_count', 'N/A')}")
print(f" Cited by count: {source.get('cited_by_count', 'N/A')}")
print(f" h-index: {source.get('summary_stats', {}).get('h_index', 'N/A')}")
print(f" 2-year mean citedness: {source.get('summary_stats', {}).get('2yr_mean_citedness', 'N/A')}")
Google Scholar profiles automatically display h-index and i10-index. No calculation needed, but coverage is the broadest (includes non-peer-reviewed sources).
# OpenAlex provides h-index directly in author profiles
author_name = "Geoffrey Hinton"
response = requests.get(
"https://api.openalex.org/authors",
params={"filter": f"display_name.search:{author_name}", "per_page": 1}
)
author = response.json()["results"][0]
print(f"h-index: {author['summary_stats']['h_index']}")
print(f"i10-index: {author['summary_stats']['i10_index']}")
print(f"2-year mean citedness: {author['summary_stats']['2yr_mean_citedness']}")
The San Francisco Declaration on Research Assessment (DORA) recommends:
tools
Show mcp-stata identity, connected tools, and status. Use when the user asks if mcp-stata is available, asks about access to the toolkit, or asks what Stata tools are connected.
tools
Activate when users mention Stata commands, .do files, regressions, econometrics, stored results, graphs, dataset inspection, replication, or Stata errors. Route the task through mcp-stata tools and the specialized research skills instead of treating it as plain text coding.
development
Build and review paper-ready regression, balance, and summary tables from Stata outputs. Use when the user needs a clean table for a draft, appendix, or coauthor share-out.
tools
Install, configure, update, or verify mcp-stata across Claude Code, Codex, Gemini CLI, Cursor, Windsurf, and VS Code. Activate when users ask to set up the Stata toolkit or troubleshoot the installation.