st-depth-estimation

Temporary repo for the development of an depth estimation tool for AddaxAI

It will be incorporated into the existing AddaxAI streamlit repo eventually, but its easier for now to just see it as a separate tool. We'll tie it together later.

Installation

Clone the repository.

git clone https://github.com/PetervanLunteren/st-depth-estimation.git

Install the dependencies:

pip install -r st-depth-estimation/requirements.txt

Usage

To run the app, use the following command:

streamlit run st-depth-estimation/app.py

Idea

The idea is that users can add an estimation of the distance from the camera to the animal. This then can provide information on the size of the animal. When users have reached this tool, they have already processed the images with AI recognition models, so the bounding box of the animals, and the prediction are known.

My idea was to run the depth anything model (https://github.com/DepthAnything/Depth-Anything-V2) over one image of the deployment, calibrate it a few points, and then be able to calculate the distances of each animal.

You can find an example deployment in this repo: /test-imgs/ In that folder there is a CSV file with all the prediciton and bbox information: /test-imgs/results_detections.csv

example detections CSV

relative_path	label	confidence	bbox_left	bbox_top	bbox_right	bbox_bottom	DateTimeOriginal	Latitude	Longitude
img_0001.jpg	species Alcelaphus buselaphus	0.92241	558	398	941	654	18/01/2013 08:58	0.27805552777777776	36.87395458333334
img_0001.jpg	family Bovidae	0.94621	1058	468	1227	574	18/01/2013 08:58	0.27805552777777776	36.87395458333334
img_0002.jpg	species Alcelaphus buselaphus	0.97218	552	397	919	652	18/01/2013 08:58	0.27805552777777776	36.87395458333334

UI

I dont really have a concrete idea of how it should look, but I believe the following should be present

let the user select a representative image to do the calibration. Perhaps using https://github.com/jrieke/streamlit-image-select ?
run the depth anything model (https://github.com/DepthAnything/Depth-Anything-V2) over that image and get the distance map for that image. It would be nice to have the RGB image next to the coloured distance map of that image. Perhaps something like this https://huggingface.co/spaces/depth-anything/Depth-Anything-V2 ?

do the actual calibration, where the user clicks on a few objects, and inputs the known distance. E.g., this tree is 3 meters away, this bush is 10 meters away, etc. How many? I don't know. We'll have to test. It would be nice to be able to click on either the RGB image or the depth map if the user perfers either one, and have calibration point show up at both images. Perhaps using https://image-coordinates.streamlit.app/dynamic_update ?
calculate the distances of all the bounding boxes of the animals in the other images in that deployment, so we can add it to the results. We can do different things here, like the distance of the center of the bounding box (perhaps not useful for weirdly shaped animals that dont always have a body in the center like snakes, giraffes, etc?). Maybe best to take the center of the bottom of the bbox, so that we have the distance at the ground level. This is also error prone due to grass or other object in the way.. We'll have to give it some trial and error... We can also caluclate the hight of the animal, which will give the user some information on the age group of the animal (juvenile / adult).

Closing remarks

This is just an initial idea! Please don't take these to strict. In my experience it always turns out different that you would expect, so please feel free to play around and try out what feels best!

Streamlit Image Selection Demo

Using the Image Selector, select the image you know a known distance point in meters. For example, a tree or fence post. Once an image is selected, scroll down and click on the object you know the distance of. This will pass the pixel coordinates. After clicking the point, enter the distance in meters in the box below. This will add a "distance_m", "px_x", and "px_y" columns to results_detections. Currently the page must be reloaded to see the additions to the predictions csv file. Note: This will modify results_detections.csv

The point of this is to get a calibration point that can be passed to the model to calculate distance. Multiple points can be added by clicking on different images and objects.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
test-imgs		test-imgs
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

st-depth-estimation

Installation

Usage

Idea

UI

Closing remarks

Streamlit Image Selection Demo

About

Uh oh!

Releases

Packages

Languages

cdavisv/st-depth-estimation

Folders and files

Latest commit

History

Repository files navigation

st-depth-estimation

Installation

Usage

Idea

UI

Closing remarks

Streamlit Image Selection Demo

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages