Skip to content

cuIpcGetMemHandle triggered CUDA out of memory when I use flexflow on one gpu  #75

@spacecat2002

Description

@spacecat2002

I used docker image "flexflow/flexflow-cuda-12.1:latest" to run flexflow on a 24GB RTX 3090,but it generated a out of memory error:

import flexflow.serve as ff 
ff.init(num_gpus=1, memory_per_gpu=11000, zero_copy_memory_per_node=11000)
[0 - 7fe1339374c0]    0.000000 {5}{gpu}: /usr/FlexFlow/deps/legion/runtime/realm/cuda/cuda_module.cc(4745):CUDA_DRIVER_FNPTR(cuIpcGetMemHandle)(&alloc.ipc_handle, alloc.dev_ptr) = 2(CUDA_ERROR_OUT_OF_MEMORY): out of memory
Aborted

Was it because I used the wrong code? How can I fix it?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions