Stochasticity makes algorithm more robust. So
are humans.
Hence, I shall embrace the uncertainty.
-
Fudan University
- Shanghai,China
-
19:01
(UTC +08:00) - https://chi-shan0707.github.io/
- https://github.com/FDUGuideBook
- https://github.com/wdzdiy-wiki
Highlights
- Pro
Pinned Loading
-
TinyLoRA-GRPO-Coder
TinyLoRA-GRPO-Coder PublicInspired by 《Learning to Reason in 13 parameters》, use TinyLoRA+GRPO(32 parameters) to fine-tune Qwen2.5-Coder-3B-Instruct(or other models) to accomplish competitive programming.
-
code-not-text
code-not-text PublicCross-domain limits of hand-crafted CoT-surface features: AUROC 0.982 in math, 0.434 in coding. Five methods, one conclusion—code correctness is not in the text.
Python
-
Qwen4Luogu-RL
Qwen4Luogu-RL PublicThis repo can work. But I make some updates in a new repo. Please see more in https://github.com/Chi-Shan0707/TinyLoRA-Qwen-Coder
Python 8
-
github-unflag-playbook-cn
github-unflag-playbook-cn PublicGitHub Unflag Playbook CN:一份写给中国大陆开发者的自救手册与存在档案。如果这份文档对您有帮助的话,阔不阔以留一个star~( ̄▽ ̄)~*
CSS 5
-
-
If the problem persists, check the GitHub status page or contact support.




