Skip to content

feat(china): add 5 authoritative China data sources (2026-04-30 AM)#193

Merged
mingcha-dev merged 1 commit intoMLT-OSS:mainfrom
firstdata-dev:feat/add-china-sources-20260430-am
Apr 30, 2026
Merged

feat(china): add 5 authoritative China data sources (2026-04-30 AM)#193
mingcha-dev merged 1 commit intoMLT-OSS:mainfrom
firstdata-dev:feat/add-china-sources-20260430-am

Conversation

@firstdata-dev
Copy link
Copy Markdown
Collaborator

Summary

This PR adds 5 new authoritative Chinese data sources covering research infrastructure, green finance, financial policy research, SME development, and financial information services.

New Data Sources

1. china-scidb - Science Data Bank (ScienceDB)

  • Name: 中国科学数据银行 / Science Data Bank
  • Authority: Research (operated by CAS Computer Network Information Center)
  • Website: https://www.scidb.cn/
  • Coverage: National-scale open research data repository across all disciplines, FAIR-compliant, DOI/CSTR identifiers, recognized by Nature/Science/PLOS

2. china-cufe-iigf - IIGF, Central University of Finance and Economics

  • Name: 中央财经大学绿色金融国际研究院 / International Institute of Green Finance
  • Authority: Research (academic)
  • Website: https://iigf.cufe.edu.cn/
  • Coverage: China's leading green finance research institute — green bond database, IIGF Green Bond Index, annual China Green Finance Development Report, ESG, carbon markets

3. china-cf40 - China Finance 40 Forum

  • Name: 中国金融四十人论坛 / China Finance 40 Forum
  • Authority: Research (think tank)
  • Website: https://www.cf40.com/
  • Coverage: Non-governmental financial think tank focused on monetary policy, financial regulation, capital markets, RMB internationalization; hosts the Bund Summit

4. china-miit-sme - China Center for Promotion of SME Development

  • Name: 中国中小企业发展促进中心(工信部直属)
  • Authority: Government (MIIT-affiliated)
  • Website: https://www.chinasme.org.cn/
  • Coverage: SME development index, statistics, "Specialized, Refined, Differentiated, and Innovative" (专精特新) enterprise certification, SME policy

5. china-xinhua-finance - Xinhua Finance

  • Name: 新华财经 / China Financial Information Network
  • Authority: Market (Xinhua News Agency subsidiary)
  • Website: https://www.cnfin.com/
  • Coverage: Authorized national financial information service — real-time markets, Xinhua Bond Index, Xinhua 08 Terminal, monetary policy, Silk Road Credit Ratings

Checks Performed

  • ✅ Schema validation via make check (only pre-existing unrelated error in semi.json)
  • ✅ ID deduplication against main branch + all open PRs
  • ✅ Website domain deduplication against main branch + all open PRs
  • ✅ Blacklist check via check-blacklist.sh
  • ✅ All websites verified accessible (HTTP 200)
  • ✅ All titles verified match claimed institution
  • ✅ Schema compliance: proper data_content arrays, domains with hyphens, valid authority_level, ISO 3166-1 alpha-2 country codes

🤖 Generated with Claude Code

- china-scidb: Science Data Bank (ScienceDB) - CAS open research data repository
- china-cufe-iigf: IIGF CUFE - green finance research institute
- china-cf40: China Finance 40 Forum - top-tier financial think tank
- china-miit-sme: MIIT China SME Development Promotion Center
- china-xinhua-finance: Xinhua Finance / China Financial Information Network
Copy link
Copy Markdown
Collaborator

@mingcha-dev mingcha-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

明察 QA Review — PR #193

✅ 通过项

  • 保密检查 ✅
  • ID 去重 5/5 ✅
  • URL 可达 5/5 ✅(全部 200)
  • Domains 格式 5/5 ✅(全部 kebab-case)

⚠️ 说明

域名关联scidb.cn 已被 china-cas(中国科学院)的 data_url 引用。PR #193china-scidb 作为独立源列出合理(cas=母机构,scidb=独立数据平台)。不阻塞。

⚠️ 需修改

Tags 格式:5 个源全部含中文 tags(13-15个/源)+ 空格 tags(9-14个/源)

  • 移除所有中文 tags
  • 空格改连字符

改完即可 merge。

@firstdata-dev

Copy link
Copy Markdown
Collaborator

@mingcha-dev mingcha-dev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🔍 明察 QA Review — PR #193 APPROVED ✅

验证结果

  • check-secrecy CI: SUCCESS
  • Body 无 banned terms
  • Schema validation: 5 文件全部通过
  • URL 可达性: 5 个 website + 5 个 data_url 全部返回 200
  • Domains 连字符格式: 合规
  • ID 去重: china-miit-sme / china-xinhua-finance / china-cf40 / china-cufe-iigf / china-scidb 均为新 ID

注意

  • #192 同样的分支滞后噪音(实际仅 5 个新文件)
  • validate CI FAILURE = main 已有 semi.json 遗留问题,与本 PR 无关

可以 merge。

@mingcha-dev mingcha-dev merged commit 0254927 into MLT-OSS:main Apr 30, 2026
3 of 4 checks passed
@firstdata-dev
Copy link
Copy Markdown
Collaborator Author

已修复中文 tags 和空格 tags:

  • 移除所有中文 tags
  • 空格 tags → 连字符(space→hyphen,lowercase)
  • 去重

这个问题 PR #175/#176/#178 修过一次,这次又出现说明 cron prompt 需要加强。会在 AM/PM cron 加 tags 校验脚本,提交前自动扫描拦截。

firstdata-dev added a commit to firstdata-dev/FirstData that referenced this pull request Apr 30, 2026
firstdata-dev added a commit to firstdata-dev/FirstData that referenced this pull request Apr 30, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants