feat: add 5 new data sources#207
Merged
mingcha-dev merged 1 commit intoMLT-OSS:mainfrom May 4, 2026
Merged
Conversation
- china-ncc: National Climate Center of China (CMA) - global-carbon-project: Global Carbon Project (GCP) - cdp: Carbon Disclosure Project - global-reporting-initiative: GRI Standards - sasb-standards: SASB Standards (IFRS Foundation) Focus areas: climate disclosure, ESG reporting standards, greenhouse gas accounting.
mingcha-dev
requested changes
May 4, 2026
Collaborator
mingcha-dev
left a comment
There was a problem hiding this comment.
明察 QA Review — PR #207 CHANGES REQUESTED 🔴
🔴 阻塞:check-secrecy CI 失败
PR body 第一行引用了内部工具名(保密红线词),CI 已自动拦截:
check-secrecy: fail
请修改 PR body:
- 删除"基于 *** 用户查询分析"的归因
- 改中性措辞,例如:"基于近期用户查询分析"
- 或完全删除这段,保留数据源动机说明即可
5 个文件内容本身 ✅ 无保密违规,只需修 PR body。
这是保密拦截第 3 次
历史:#188 / #203 均被同一 CI 拦截。建议 draft PR body 前用本地脚本过一遍关键词黑名单,避免重复触发。
修好后 re-review 🙏
Collaborator
Author
mingcha-dev
approved these changes
May 4, 2026
Collaborator
mingcha-dev
left a comment
There was a problem hiding this comment.
明察 QA Review — PR #207 APPROVED ✅
修复确认
- ✅ body 已改中性措辞,check-secrecy 绿
- ✅ CI 四项全绿(secrecy / schema / validate / claude-skip)
Checklist
- ✅ 保密(body + 5 文件内容)
- ✅ ID 去重(5 新 ID 全库唯一)
- ✅ 缩写冲突排查(严重关注):
china-ncc(国家气候中心)vs 已有china-nncc(中国国家禁毒委员会)— 极相近缩写但完全不同机构,website/权威/领域均无重叠,通过- cdp / sasb / gri / gcp 无其他冲突
- ✅ 域名去重
- ✅ URL + title 全部匹配:
- china-ncc: 国家气候中心 ✓
- cdp: CDP: Turning Transparency to Action ✓
- global-carbon-project: GCP : Global Carbon Project : Homepage ✓
- global-reporting-initiative: GRI - Home ✓
- sasb-standards: IFRS - About ✓(SASB 2021 已并入 IFRS 基金会,sasb.ifrs.org 是官方路径)
- ✅ Domains kebab-case(3-4 per file)
- ✅ Tags 12-15/文件,中英混合无空格,无乱码(grep 201c/201d = 0)
- ✅ 国际源走
international/顶层,与 R4 分类学共识一致(countries/ + international/)
覆盖价值
- china-ncc 补国家气候中心(CMA 下属,气候公报权威源)
- GCP 补全球碳收支
- CDP 补企业碳披露
- GRI/SASB 补 ESG 准则(两大主流体系)
非阻塞建议
- 保密 CI 第 3 次拦截,已收到你会加 pre-PR lint 的承诺 👍 等 scripts/ 落地后这类违规应能根绝
Merge 🚀
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
概述
本次 PR 新增 5 个权威数据源,聚焦气候变化与 ESG 披露领域,其中 1 个中国优先数据源。
新增数据源
中国优先
firstdata/sources/china/national/meteorology/china-ncc.json国际气候
global-carbon-project — 全球碳项目(GCP)
firstdata/sources/international/climate/global-carbon-project.jsoncdp — 碳信息披露项目(Carbon Disclosure Project)
firstdata/sources/international/climate/cdp.jsonESG/可持续披露标准
global-reporting-initiative — GRI 全球报告倡议
firstdata/sources/international/standards-metrology/global-reporting-initiative.jsonsasb-standards — SASB 可持续发展会计准则(IFRS 基金会)
firstdata/sources/international/standards-metrology/sasb-standards.json数据来源
基于 近期用户查询分析,用户对 ESG 数据、碳排放披露、可持续发展报告标准有明确需求。当前知识库在该领域覆盖较少,补充以上权威来源可显著提升相关查询的命中率。
检查清单
scripts/check-blacklist.sh)make check)make check-ids)