-
Notifications
You must be signed in to change notification settings - Fork 59
[Refactor] Habitat Refactor & Support Distributed VLN-PE Evaluation #168
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…aac sim gpu setting
…d on lmdb path + rank; remove TODOs and unused files
scripts/eval/eval.py
Outdated
| from internnav.configs.evaluator.vln_default_config import get_config | ||
|
|
||
| evaluator_cfg = get_config(evaluator_cfg) | ||
| elif evaluator_cfg.eval_type == 'habitat_vln': |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个是干什么的,habitat vln
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个flag没有用到,我去掉
scripts/eval/eval.py
Outdated
| default='scripts/eval/configs/h1_rdp_cfg.py', | ||
| help='eval config file path, e.g. scripts/eval/configs/h1_cma_cfg.py', | ||
| ) | ||
| parser.add_argument('--port', type=int, default=None) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
我看server里面写了默认的host和port,这里也写一下会不会好一些,方便跑
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
好的,已在AgentConfig里面加了默认的host和port
…tics#176) * [file] Update 3d printing files for camera of Unitree Go2. * [Doc] Update readme for 3d printing models.
* update vln yaml; fix import agent * update habitat, using evaluator and config; env and agent is WIP * add distributed_base evaluator * Habitat env applied, distributed evaluator applied; clean evaluator and agent created * fix observation issues * update new register name; tiny fix on style * latest tested * delete temp agent; rename default evaluator for habitat * update slurm bash * update readme * fix init dist print * fix eval config; fix local rank to rank * update init distributed mode if condition * update dist for dlc * fix bug in evaluator * update distributed vln multi, episode loader * cma tested * cma tested; episode loader; torchrun; local agent tested; TODO fix isaac sim gpu setting * add vlnpe distributed script * add grscene; add new result write and resumble load data feature based on lmdb path + rank; remove TODOs and unused files * fix bugs in evaluator and dataset for distributed; n1 and rdp tested * fix comm log concurrency issue, create file with exist true * fix progress log mkdir race condition * fix comments * polish existing configs and bash * update bash align with doc * fix CI test * rename habitat_extensions * fix comments * remove useless line * [file] Update 3D Printing Files for Camera of Unitree Go2 (InternRobotics#176) * [file] Update 3d printing files for camera of Unitree Go2. * [Doc] Update readme for 3d printing models. * fix visualize image size; fix unused comment code; fixed evaluator name * bump to version v0.0.2 * update vlnmulti to VLN; update habitatVln to HabitatVLN * :Revert "Merge branch 'main' into vlnpe_refactor" This reverts commit 7a23b72, reversing changes made to ea73075. --------- Co-authored-by: Yuqiang Yang <49871992+yuqiang-yang@users.noreply.github.com>
PR Summary
reset/step, episode assignment, and episode reloading inside the worker, significantly streamlining the evaluation pipeline.Distributed Eval Time
Using 16 nodes, each with 1× RTX 4090 GPU, 8 CPUs, and 60 GB RAM.
Performance after Refactor
InternVLA-N1InternVLA-N1InternVLA-N1RDPRDP