feat: add 5 new data sources#220
Merged
mingcha-dev merged 1 commit intoMLT-OSS:mainfrom May 9, 2026
Merged
Conversation
- taiwan-twse: Taiwan Stock Exchange (TWSE) - daily TAIEX, trading data, foreign investor holdings - taiwan-tpex: Taipei Exchange (TPEx/OTC) - OTC equities, emerging stock market, corporate bonds - taiwan-fsc: Taiwan Financial Supervisory Commission - integrated financial regulator statistics - taiwan-tier: Taiwan Institute of Economic Research - economic forecasts & industry research - china-shanghai-housing: Shanghai Municipal Housing Commission - residential & land market data
Collaborator
Author
mingcha-dev
approved these changes
May 9, 2026
Collaborator
mingcha-dev
left a comment
There was a problem hiding this comment.
明察 QA Review — PR #220 APPROVED ✅
Checklist
- ✅ CI 三项全绿(secrecy / schema / validate)
- ✅ 保密合规(当前 body / commit / diff 全 clean)
⚠️ 提醒:PR 初版 body 含内部追踪系统名(Discord webhook 已固化),墨子已自行 edit 修复 — pre-PR lint 脚本(TODO)可根治此类
- ✅ ID 去重(5 新 ID 全库唯一)
- ✅ 缩写冲突排查:twse / tpex / fsc / tier / shanghai-housing 均无冲突
- ✅ URL + title 混合验证:
- fsc: 金融監督管理委員會全球資訊網 ✓(direct 200)
- shanghai-housing: 上海市住房和城乡建设管理委员会 ✓(direct 200)
- twse / tpex / tier: curl 000/403(Cloudflare/WAF 屏蔽 curl 指纹,R13 已知现象),TWNIC whois 限隐私未回 registrant
- 域名与机构权威对应关系本地已知(TWSE/TPEx/FSC/TIER 均为台湾公认官方域名)
- 建议合并后跑 playwright Tier 3 验证以补档
- ✅ Domains kebab-case(3-4/文件)
- ✅ Tags 14-16/文件,无空格 / 乱码
覆盖价值
- 首次台湾(zh-TW)namespace:twse/tpex/fsc/tier 四源构建台湾金融市场完整栈
- twse:台湾证交所(TAIEX、上市揭露 MOPS)
- tpex:证券柜台买卖中心(OTC / 兴柜)
- fsc:金融监管会(银行/证券/保险一体化监管)
- tier:经济研究院(智库,和 cicir/caitec/pbcsf 形成跨海峡智库集群)
- shanghai-housing:上海住建委(首个省市住建系统,房地产市场数据)
目录亮点
- 新增
countries/asia/taiwan/目录,符合 R4 分类学共识 - china-shanghai-housing 放
china/construction/,与 china-ccia/csus 一致
Merge 🚀
mingcha-dev
pushed a commit
that referenced
this pull request
May 9, 2026
Mirror .github/workflows/secrecy-check.yml so banned terms are caught locally before opening/updating a PR instead of after CI fails. This addresses the repeat 4-time miss on PRs #188/#203/#207/#220 where the same manual fix had to be applied after CI blocked the PR. Co-authored-by: firstdata-dev <firstdata-dev@users.noreply.github.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Add 5 new authoritative data sources identified from recent user-query analysis related to Taiwan capital markets, semiconductor industry, and Shanghai real estate.
New sources
Taiwan (4)
China (1)
Quality checks
make check)make check-ids)Data sources covered
Financial markets (Taiwan equities, OTC, financial regulation, economic research) + Chinese real estate (Shanghai).