support replay buffer save and resume, save_hf in trainer#1592
support replay buffer save and resume, save_hf in trainer#1592YanhuiDua merged 4 commits intoInternLM:rl_designfrom
Conversation
|
@claude review |
|
Claude: ## Summary Adds IssuesWarning
Nit
VerdictCOMMENT — no critical bugs found. The core logic is correct and tests are solid. Please address the blocking I/O and docstring issues before merge. |
5c933c6 to
c918dba
Compare
|
@claude review |
| @@ -331,6 +376,38 @@ async def count(self, task_name: str, group_status: Status) -> int: | |||
| def __len__(self) -> int: | |||
There was a problem hiding this comment.
Claude: Nit — Missing docstrings on public methods
Per project standards (CLAUDE.md), public methods must have Google-style docstrings. save, resume, state_dict, and load_state_dict are all public and undocumented.
|
Claude: ## Summary Adds save/resume checkpointing for IssuesCritical
Warning
Nit
VerdictREQUEST_CHANGES — the tuple unpacking bug silently disables HF path validation and must be fixed before merge. |
c918dba to
c18b4f2
Compare
|
@claude review |
6776063 to
ba549d7
Compare
ba549d7 to
24c8789
Compare
No description provided.