Skip to content

markjihwan/community-abtest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

19 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

ABTest Experiment Platform

abtest๋Š” ์ปค๋ฎค๋‹ˆํ‹ฐ/๋Ÿฌ๋‹ ํ”„๋กœ๊ทธ๋žจ ์šด์˜ ํ™˜๊ฒฝ์—์„œ ์‹คํ—˜์„ ์„ค๊ณ„ํ•˜๊ณ  ํ•ด์„ํ•˜๊ธฐ ์œ„ํ•œ ๋ฌธ์„œ ์ค‘์‹ฌ ํ”„๋กœ์ ํŠธ๋‹ค.

์ด ํ”„๋กœ์ ํŠธ๋Š” ์ด์ƒ์ ์ธ ๋žœ๋ค A/B ํ…Œ์ŠคํŠธ๋ฅผ ์ง€ํ–ฅํ•˜๋˜, ์‹ค์ œ ์šด์˜์—์„œ๋Š” ๊ธฐ์ˆ˜ ๋‹จ์œ„ ๋น„๊ต๊ฐ€ ์ค‘์‹ฌ์ด ๋˜๋Š” ํ™˜๊ฒฝ์„ ์ „์ œ๋กœ ํ•œ๋‹ค. ๋”ฐ๋ผ์„œ ๋ณธ ๋ฌธ์„œ ์„ธํŠธ๋Š” randomized A/B test์™€ cohort-based comparative experiment๋ฅผ ๊ตฌ๋ถ„ํ•ด์„œ ์„ค๋ช…ํ•œ๋‹ค.

Core Principle

๋ณธ ํ”„๋กœ์ ํŠธ์˜ ์‹คํ—˜ ํ‰๊ฐ€๋Š” ๋žœ๋ค A/B ํ…Œ์ŠคํŠธ๊ฐ€ ์–ด๋ ค์šด ์šด์˜ ํ™˜๊ฒฝ์„ ๊ณ ๋ คํ•˜์—ฌ, ๊ธฐ์ˆ˜ ๋‹จ์œ„ cohort ๋น„๊ต๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์ˆ˜ํ–‰ํ•œ๋‹ค. ํ•ต์‹ฌ ์„ฑ๊ณผ๋Š” ์™„์ฃผ์œจ์„ North Star Metric์œผ๋กœ ๋‘๊ณ , Funnel ๋ถ„์„์œผ๋กœ ๋‹จ๊ณ„๋ณ„ ์ดํƒˆ์„ ํŒŒ์•…ํ•˜๋ฉฐ, Retention ๋ถ„์„์œผ๋กœ ์ง€์† ์ฐธ์—ฌ๋ฅผ ์ธก์ •ํ•œ๋‹ค. ์ตœ์ข… ํšจ๊ณผ ํŒ๋‹จ์€ Bayesian ๊ธฐ๋ฐ˜ ํ™•๋ฅ  ํ•ด์„์„ ์ค‘์‹ฌ์œผ๋กœ ์ˆ˜ํ–‰ํ•˜๊ณ , ํ•„์š” ์‹œ Sequential Testing๊ณผ CUPED๋ฅผ ๋ณด์กฐ์ ์œผ๋กœ ํ™œ์šฉํ•œ๋‹ค.

Start Here

์ง€๊ธˆ๋ถ€ํ„ฐ๋Š” ๋ฌธ์„œ๋ฅผ ํ•ต์‹ฌ 7๊ฐœ + ๋ถ€๋ก ๊ตฌ์กฐ๋กœ ์ฝ๋Š” ๊ฒƒ์„ ๊ถŒ์žฅํ•œ๋‹ค.

Core Docs

  • docs/01_FOUNDATIONS.md: ์‹คํ—˜ ์ฒ ํ•™, ํ†ต๊ณ„ ๊ธฐ์ดˆ, test design์˜ ์ž…๋ฌธ ๋ฌถ์Œ
  • docs/02_EXPERIMENT_POLICY.md: ์‹คํ—˜ ๋“ฑ๋ก, ์Šน์ธ, ๋ฐ์ดํ„ฐ/์ฐธ์—ฌ์ž/๊ฒฐ๊ณผ ํ™œ์šฉ ์ •์ฑ… ๋ฌถ์Œ
  • docs/03_METRICS.md: metric ์ •์˜์™€ KPI ์šฐ์„ ์ˆœ์œ„
  • docs/04_VALIDITY_AND_TRUST.md: peeking, SRM, novelty, network effect, ํ’ˆ์งˆ ๋ฆฌ์Šคํฌ
  • docs/05_ADVANCED_METHODS.md: ratio metrics, multiple testing, variance reduction, sequential testing
  • docs/06_PLATFORM_SCHEMA.md: ๋ฐ์ดํ„ฐ ์Šคํ‚ค๋งˆ์™€ ํ†ต๊ณ„ ์ปฌ๋Ÿผ ์„ค๊ณ„
  • docs/07_OPERATIONS_AND_DECISIONS.md: ์šด์˜ ์ฒดํฌ๋ฆฌ์ŠคํŠธ์™€ ์ตœ์ข… ํŒ๋‹จ ๊ธฐ์ค€

Appendix

  • docs/COMMUNITY_BENCHMARKS.md: ์™ธ๋ถ€ ์‚ฌ๋ก€์™€ ๋ฒค์น˜๋งˆํฌ
  • docs/V1_SCOPE_AND_GAPS.md: ํ˜„์žฌ ๋ฒ”์œ„ ์ ๊ฒ€๊ณผ v1 ์šฐ์„ ์ˆœ์œ„
  • docs/REFERENCE_MAP.md: ๊ธฐ์กด ์„ธ๋ถ€ ๋ฌธ์„œ์™€ ์ƒˆ ๊ทธ๋ฃน ๋ฌธ์„œ์˜ ๋งคํ•‘
  • docs/archive/: ์ˆ˜์ • ์ „ ์„ธ๋ถ€ ๋ฌธ์„œ ์•„์นด์ด๋ธŒ

Recommended Reading Order

  1. docs/01_FOUNDATIONS.md
  2. docs/02_EXPERIMENT_POLICY.md
  3. docs/03_METRICS.md
  4. docs/04_VALIDITY_AND_TRUST.md
  5. docs/06_PLATFORM_SCHEMA.md
  6. docs/07_OPERATIONS_AND_DECISIONS.md
  7. docs/05_ADVANCED_METHODS.md

ํ”„๋กœ์ ํŠธ ๊ตฌ์กฐ

community-abtest/
โ”‚
โ”œโ”€โ”€ CLAUDE.md                        โ† Claude Code ์ง„์ž…์  (๋งฅ๋ฝ + ํŒ๋‹จ ์›์น™)
โ”œโ”€โ”€ .mcp.json                        โ† MCP ์„ค์ • (docs/ ๋งˆ์šดํŠธ)
โ”‚
โ”œโ”€โ”€ .claude/
โ”‚   โ”œโ”€โ”€ agents/
โ”‚   โ”‚   โ””โ”€โ”€ abtest-analyst.md        โ† ํŒ๋‹จ ์›์น™ + Syneidesis ๊ฐญ ์ถ”์ 
โ”‚   โ””โ”€โ”€ skills/
โ”‚       โ”œโ”€โ”€ experiment-register/     โ† ์‹คํ—˜ ๋“ฑ๋ก & ์Šน์ธ ์ฒดํฌ๋ฆฌ์ŠคํŠธ
โ”‚       โ”œโ”€โ”€ metrics-definition/      โ† ์ง€ํ‘œ ์ •์˜ & ์šฐ์„ ์ˆœ์œ„
โ”‚       โ”œโ”€โ”€ experiment-design/       โ† ์‹คํ—˜ ์„ค๊ณ„ ์›Œํฌํ”Œ๋กœ์šฐ
โ”‚       โ”œโ”€โ”€ validity-check/          โ† SRM, peeking, network effect ์ ๊ฒ€
โ”‚       โ”œโ”€โ”€ knowledge-audit/         โ† ์ง€์‹ ๊ฒ€์ฆ ๋ฃจํ”„ (autoresearch ํŒจํ„ด)
โ”‚       โ”œโ”€โ”€ experiment-decision/     โ† ship/hold/rollback/need_more_data ํŒ์ •
โ”‚       โ””โ”€โ”€ advanced-analysis/       โ† CUPED, sequential, ratio metrics
โ”‚
โ”œโ”€โ”€ docs/                            โ† MCP๋กœ ๋งˆ์šดํŠธ๋˜๋Š” ์ง€์‹ ๋ฒ ์ด์Šค
โ”‚   โ”œโ”€โ”€ 01_FOUNDATIONS.md
โ”‚   โ”œโ”€โ”€ 02_EXPERIMENT_POLICY.md
โ”‚   โ”œโ”€โ”€ 03_METRICS.md
โ”‚   โ”œโ”€โ”€ 04_VALIDITY_AND_TRUST.md
โ”‚   โ”œโ”€โ”€ 05_ADVANCED_METHODS.md
โ”‚   โ”œโ”€โ”€ 06_PLATFORM_SCHEMA.md
โ”‚   โ”œโ”€โ”€ 07_OPERATIONS_AND_DECISIONS.md
โ”‚   โ”œโ”€โ”€ SKILL_GUIDE.md               โ† Skills ํ™œ์šฉ ๊ฐ€์ด๋“œ
โ”‚   โ””โ”€โ”€ archive/                     โ† ์„ธ๋ถ€ ๋ฌธ์„œ ์›๋ณธ
โ”‚
โ”œโ”€โ”€ scripts/                         โ† ๊ฒฐ์ •๋ก ์  ๊ณ„์‚ฐ ์Šคํฌ๋ฆฝํŠธ (ํ‘œ์ค€ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ๋งŒ ์‚ฌ์šฉ)
โ”‚   โ”œโ”€โ”€ calc_sample_size.py          โ† ์™„์ฃผ์œจ ๊ธฐ๋ฐ˜ ํ‘œ๋ณธ ํฌ๊ธฐ ๊ณ„์‚ฐ (Cohen's h)
โ”‚   โ”œโ”€โ”€ check_balance.py             โ† ๊ณต๋ณ€๋Ÿ‰ ๊ท ํ˜• ๊ฒ€์‚ฌ (SMD)
โ”‚   โ”œโ”€โ”€ bayesian_calc.py             โ† Bayesian P(T>C) ๊ณ„์‚ฐ (Beta-Binomial)
โ”‚   โ””โ”€โ”€ stratification_check.py     โ† ์ธตํ™” ๋ถ„์„ ๊ฐ€๋Šฅ ์—ฌ๋ถ€ (์…€๋‹น 20๋ช… ๊ธฐ์ค€)
โ”‚
โ””โ”€โ”€ experiments/                     โ† ์‹คํ—˜ ๋“ฑ๋ก์„œ ์ €์žฅ์†Œ
    โ”œโ”€โ”€ TEMPLATE.md                  โ† ์‹คํ—˜ ๋“ฑ๋ก์„œ ํ…œํ”Œ๋ฆฟ
    โ””โ”€โ”€ 12ki_w7_magical_week.md      โ† 12๊ธฐ W7 Magical Week ์ค€์‹คํ—˜

Claude Code ์—ฐ๋™

์ด ๋ ˆํฌ๋Š” Claude Code์™€ ํ•จ๊ป˜ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋„๋ก Agent + Skills๊ฐ€ ๊ตฌ์„ฑ๋˜์–ด ์žˆ๋‹ค.

์‹œ์ž‘ ๋ฐฉ๋ฒ•

์•„๋ž˜ ๋‘ ๊ฐ€์ง€๊ฐ€ ์„ค์น˜๋˜์–ด ์žˆ์–ด์•ผ ํ•œ๋‹ค:

  • Node.js โ€” MCP ์„œ๋ฒ„ ์‹คํ–‰์šฉ
  • jq โ€” Hook ์Šคํฌ๋ฆฝํŠธ JSON ํŒŒ์‹ฑ์šฉ
# jq ์„ค์น˜ (Windows)
winget install jqlang.jq

# jq ์„ค์น˜ (Mac)
brew install jq

์ดํ›„ ์ด ๋ ˆํฌ ๋””๋ ‰ํ† ๋ฆฌ์—์„œ claude๋ฅผ ์‹คํ–‰ํ•˜๋ฉด MCP๊ฐ€ ์ž๋™์œผ๋กœ ./docs๋ฅผ ๋งˆ์šดํŠธํ•œ๋‹ค.

git clone <this-repo>
cd community-abtest
claude

Agent

  • abtest-analyst โ€” ์‹คํ—˜ ๋ถ„์„ ์ „๋ฌธ๊ฐ€. ํŒ๋‹จ ์›์น™๊ณผ Syneidesis(๊ฐญ ์ถ”์ ) ํŒจํ„ด์ด ๋‚ด์žฅ๋˜์–ด ์žˆ๋‹ค.

Skills

Skill ํŠธ๋ฆฌ๊ฑฐ ์˜ˆ์‹œ
experiment-register "์‹คํ—˜ ์‹œ์ž‘ ์ „์— ๋ญ ํ•ด์•ผ ํ•ด"
metrics-definition "์ง€ํ‘œ ์–ด๋–ป๊ฒŒ ์ •์˜ํ•ด", "guardrail ๋ญ๋กœ ์žก์•„"
experiment-design "์‹คํ—˜ ์„ค๊ณ„ํ•ด์ค˜", "์ƒ˜ํ”Œ ์‚ฌ์ด์ฆˆ ๊ณ„์‚ฐ"
validity-check "SRM ์˜์‹ฌ๋ผ", "์ด ์‹คํ—˜ ๋ฏฟ์–ด๋„ ๋ผ?"
knowledge-audit "์ด ๋‚ด์šฉ ๋งž์•„?", "์ƒˆ ๋…ผ๋ฌธ ์ ์šฉ ๊ฐ€๋Šฅํ•ด?"
experiment-decision "๊ฒฐ๊ณผ ์–ด๋–ป๊ฒŒ ๋ด", "์ด๊ฑฐ ์˜ฌ๋ ค๋„ ๋ผ?"
advanced-analysis "CUPED ์จ์•ผ ํ•ด?", "sequential testing ๊ฐ€๋Šฅํ•ด?"

Skills๋Š” ์ˆœ์„œ๋Œ€๋กœ ์—ฐ๊ฒฐ๋˜์–ด ์žˆ๋‹ค: experiment-register โ†’ experiment-design โ†’ validity-check โ†’ experiment-decision

Experiments

์‹ค์ œ ์‹คํ—˜ ๋“ฑ๋ก์„œ๋Š” experiments/ ํด๋”์— ์ €์žฅํ•œ๋‹ค.

Note

๊ธฐ์กด ์„ธ๋ถ€ ๋ฌธ์„œ๋Š” docs/archive/๋กœ ์ด๋™ํ•ด ๋ณด๊ด€ํ•œ๋‹ค. ์•ž์œผ๋กœ๋Š” ์ƒˆ ๊ทธ๋ฃน ๋ฌธ์„œ๋ฅผ ๊ธฐ์ค€์œผ๋กœ ์ฝ๊ณ , ์„ธ๋ถ€ ๋ฌธ์„œ๋Š” ํ•„์š”ํ•  ๋•Œ๋งŒ ์ฐธ๊ณ ํ•˜๋Š” ๊ตฌ์กฐ๋ฅผ ๊ถŒ์žฅํ•œ๋‹ค.

About

community_abtest policy

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors