Skip to content

Conversation

@mconcas
Copy link
Collaborator

@mconcas mconcas commented Jan 24, 2025

  • Add processNeighbours GPU kernel and handler
  • Update Detectors/ITSMFT/ITS/tracking/GPU/cuda/CMakeLists.txt
  • Fix second iteration
  • Move the whole processNeighbours on GPU

@mconcas mconcas requested a review from shahor02 as a code owner January 24, 2025 13:09
@github-actions
Copy link
Contributor

REQUEST FOR PRODUCTION RELEASES:
To request your PR to be included in production software, please add the corresponding labels called "async-" to your PR. Add the labels directly (if you have the permissions) or add a comment of the form (note that labels are separated by a ",")

+async-label <label1>, <label2>, !<label3> ...

This will add <label1> and <label2> and removes <label3>.

The following labels are available
async-2023-pbpb-apass4
async-2023-pp-apass4
async-2024-pp-apass1
async-2022-pp-apass7
async-2024-pp-cpass0
async-2024-PbPb-apass1
async-2024-ppRef-apass1

@mconcas mconcas force-pushed the pr_gpu_trackletfinder branch 4 times, most recently from b785d6d to 2e9bcf3 Compare January 24, 2025 15:43
@alibuild
Copy link
Collaborator

Error while checking build/O2/fullCI for 2e9bcf3 at 2025-01-24 17:59:

## sw/BUILD/O2-latest/log
c++: error: unrecognized command-line option '--rtlib=compiler-rt'
c++: error: unrecognized command-line option '--rtlib=compiler-rt'


## sw/BUILD/o2checkcode-latest/log
--
========== List of errors found ==========
++ GRERR=0
++ grep -v clang-diagnostic-error error-log.txt
++ grep ' error:'
/sw/BUILD/1762a59b37842b7e2c1eda250e9aa393a3daf9d5/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1236:47: error: use nullptr [modernize-use-nullptr]
/sw/BUILD/1762a59b37842b7e2c1eda250e9aa393a3daf9d5/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1243:47: error: use nullptr [modernize-use-nullptr]
/sw/BUILD/1762a59b37842b7e2c1eda250e9aa393a3daf9d5/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1300:49: error: use nullptr [modernize-use-nullptr]
/sw/BUILD/1762a59b37842b7e2c1eda250e9aa393a3daf9d5/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1307:49: error: use nullptr [modernize-use-nullptr]
++ [[ 0 == 0 ]]
++ exit 1
--

Full log here.

…eO2Group#13816)

Add processNeighbours GPU kernel and handler

Update Detectors/ITSMFT/ITS/tracking/GPU/cuda/CMakeLists.txt

Fix second iteration

Move the whole processNeighbours on GPU
@mconcas mconcas force-pushed the pr_gpu_trackletfinder branch from 2e9bcf3 to 14a816a Compare January 24, 2025 17:01
@alibuild
Copy link
Collaborator

Error while checking build/O2/fullCI_slc9 for 2e9bcf3 at 2025-01-24 18:13:

## sw/BUILD/O2Physics-latest/log
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:


## sw/BUILD/o2checkcode-latest/log
--
========== List of errors found ==========
++ GRERR=0
++ grep -v clang-diagnostic-error error-log.txt
++ grep ' error:'
/sw/BUILD/c7b780c1fa6ccf955c9b33b881fd76bccc7f041d/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1236:47: error: use nullptr [modernize-use-nullptr]
/sw/BUILD/c7b780c1fa6ccf955c9b33b881fd76bccc7f041d/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1243:47: error: use nullptr [modernize-use-nullptr]
/sw/BUILD/c7b780c1fa6ccf955c9b33b881fd76bccc7f041d/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1300:49: error: use nullptr [modernize-use-nullptr]
/sw/BUILD/c7b780c1fa6ccf955c9b33b881fd76bccc7f041d/O2/Detectors/ITSMFT/ITS/tracking/GPU/hip/TrackingKernels.hip:1307:49: error: use nullptr [modernize-use-nullptr]
++ [[ 0 == 0 ]]
++ exit 1
--

Full log here.

@mconcas mconcas merged commit fb7b17c into AliceO2Group:dev Jan 25, 2025
13 checks passed
@mconcas mconcas deleted the pr_gpu_trackletfinder branch January 25, 2025 14:34
@alibuild
Copy link
Collaborator

Error while checking build/O2/fullCI_slc9 for 14a816a at 2025-01-25 16:39:

## sw/BUILD/O2Physics-latest/log
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:
Error in cling::AutoLoadingVisitor::InsertIntoAutoLoadingState:


## sw/BUILD/O2-sim-challenge-test-latest/log
./sim-challenge.logDetected critical problem in logfile itsreco.log
./sim-challenge.logitsreco.log:[8890:internal-dpl-ccdb-backend]: [16:39:30][ERROR] Exception while running: Fatal error. Rethrowing.
./sim-challenge.logitsreco.log-[8890:internal-dpl-ccdb-backend]: [16:39:30][FATAL] Unhandled o2::framework::runtime_error reached the top of main of o2-its-reco-workflow, device shutting down. Reason: Fatal error
./sim-challenge.log[8890:internal-dpl-ccdb-backend]: [16:39:29][ERROR] CCDBDownloader CURL transfer error - Timeout was reached
./sim-challenge.log[8890:internal-dpl-ccdb-backend]: [16:39:29][ERROR] CcdbDownloader finished transfer http://alice-ccdb.cern.ch/GLO/Config/GRPMagField for 1546300800000 (agent_id: alimetal01.cern.ch-1737819560-IYN2qr) with http code: 0
./sim-challenge.log[8890:internal-dpl-ccdb-backend]: [16:39:29][ERROR] File GLO/Config/GRPMagField could not be retrieved. No more hosts to try.
./sim-challenge.log[8890:internal-dpl-ccdb-backend]: [16:39:29][FATAL] Unable to find object GLO/Config/GRPMagField/1546300800000
./sim-challenge.log[8890:internal-dpl-ccdb-backend]: [16:39:30][ERROR] Exception while running: Fatal error. Rethrowing.
./sim-challenge.log[8890:internal-dpl-ccdb-backend]: [16:39:30][FATAL] Unhandled o2::framework::runtime_error reached the top of main of o2-its-reco-workflow, device shutting down. Reason: Fatal error
./sim-challenge.log[ERROR] Workflow crashed - PID 8890 (internal-dpl-ccdb-backend) did not exit correctly however it's not clear why. Exit code forced to 128.
./sim-challenge.log[ERROR]  - Device internal-dpl-ccdb-backend: pid 8890 (exit 128)
./sim-challenge.log[INFO]    - First error: [16:39:29][FATAL] Unable to find object GLO/Config/GRPMagField/1546300800000
./sim-challenge.log[ERROR] SEVERE: Device internal-dpl-ccdb-backend (8890) had at least one message above severity 5: Unable to find object GLO/Config/GRPMagField/1546300800000
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/37}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/38}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/40}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/42}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/43}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/45}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/46}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/47}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/48}
./digi.log[ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/49}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/37}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/38}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/40}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/42}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/43}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/45}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/46}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/47}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/48}
./digi.log[5401:internal-dpl-ccdb-backend]: [ERROR] Found duplicate input binding with different spec.:collisioncontext {SIM/COLLISIONCONTEXT/49}
[0 more errors; see full log]

Full log here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants