bdaiinstitute · ayushzenith · Sep 25, 2024 · Sep 25, 2024 · Sep 25, 2024 · Sep 25, 2024
diff --git a/README.md b/README.md
@@ -35,32 +35,52 @@ Understanding how humans leverage semantic knowledge to navigate unfamiliar envi
 
 ## :hammer_and_wrench: Installation
 
-### Getting Started
+### Getting Started: Environment
 
 Create the conda environment:
 ```bash
 conda_env_name=vlfm
 conda create -n $conda_env_name python=3.9 -y &&
 conda activate $conda_env_name
 ```
+
+Check compiled cuda version/available cuda version and install the closest available corresponding torch. *This will be important for successful GroundingDINO installation*
+
 If you are using habitat and are doing simulation experiments, install this repo into your env with the following:
 ```bash
 pip install -e .[habitat]
 ```
+
 If you are using the Spot robot, install this repo into your env with the following:
 ```bash
 pip install -e .[reality]
 ```
-Install all the dependencies:
+
+Clone the below dependencies into the root directory of this repo:
 ```bash
-git clone [email protected]:IDEA-Research/GroundingDINO.git
-git clone [email protected]:WongKinYiu/yolov7.git  # if using YOLOv7
+git clone https://github.com/IDEA-Research/GroundingDINO.git
+git clone https://github.com/WongKinYiu/yolov7.git  # if using YOLOv7
+git clone https://github.com/facebookresearch/habitat-lab
+git clone https://github.com/naokiyokoyama/bd_spot_wrapper.git
 ```
 Follow the original install directions for GroundingDINO, which can be found here: https://github.com/IDEA-Research/GroundingDINO.
 
 Nothing needs to be done for YOLOv7, but it needs to be cloned into the repo.
 
-### Installing GroundingDINO (Only if using conda-installed CUDA)
+Follow the original install directions for Habitat-lab, which can be found here: https://github.com/facebookresearch/habitat-lab.
+
+Follow the original install directions for bd spot wrapper, which can be found here: https://github.com/naokiyokoyama/bd_spot_wrapper.git.
+
+The following pip install's will also be required:
+
+```bash
+pip install hydra-core --upgrade
+pip install numba # Try not to downgrade numpy during this instalation if possible. Go to https://numba.readthedocs.io/en/stable/user/installing.html for more installation instructions.
+pip install gym
+pip install timm==0.6.12 # Ignore any issues about incompatibility. Need version >=0.6.12 for depth model.
+```
+
+#### Troubleshooting: Installing GroundingDINO (Only if using conda-installed CUDA)
 Only attempt if the installation instructions in the GroundingDINO repo do not work.
 
 To install GroundingDINO, you will need `CUDA_HOME` set as an environment variable. If you would like to install a certain version of CUDA that is compatible with the one used to compile your version of pytorch, and you are using conda, you can run the following commands to install CUDA and set `CUDA_HOME`:
@@ -81,7 +101,23 @@ ln -s ${CONDA_PREFIX}/lib/python3.9/site-packages/nvidia/cusolver/include/*  ${C
 export CUDA_HOME=${CONDA_PREFIX}
 ```
 
-## :dart: Downloading the HM3D dataset
+
+### :weight_lifting: Downloading weights for various models
+The weights for MobileSAM, GroundingDINO, and PointNav must be saved to the `data/` directory. The weights can be downloaded from the following links:
+- `mobile_sam.pt`:  https://github.com/ChaoningZhang/MobileSAM
+- `groundingdino_swint_ogc.pth`: https://github.com/IDEA-Research/GroundingDINO
+- `yolov7-e6e.pt`: https://github.com/WongKinYiu/yolov7
+- `pointnav_weights.pth`: included inside the [data](data) subdirectory
+
+```bash
+cd data/
+wget -q https://github.com/ChaoningZhang/MobileSAM/blob/master/weights/mobile_sam.pt
+wget -q https://github.com/IDEA-Research/GroundingDINO/releases/download/v0.1.0-alpha/groundingdino_swint_ogc.pth
+wget -q https://github.com/WongKinYiu/yolov7/releases/download/v0.1/yolov7-e6e.pt
+```
+
+
+### :dart: Dataset: Downloading the HM3D dataset
 
 ### Matterport
 First, set the following variables during installation (don't need to put in .bashrc):
@@ -97,7 +133,7 @@ DATA_DIR=</path/to/vlfm/data>
 HM3D_OBJECTNAV=https://dl.fbaipublicfiles.com/habitat/data/datasets/objectnav/hm3d/v1/objectnav_hm3d_v1.zip
 ```
 
-### Clone and install habitat-lab, then download datasets
+### Make sure that you have cloned and installed *habitat-lab*, then download datasets
 *Ensure that the correct conda environment is activated!!*
 ```bash
 # Download HM3D 3D scans (scenes_dataset)
@@ -118,13 +154,6 @@ mv objectnav_hm3d_v1 $DATA_DIR/datasets/objectnav/hm3d/v1 &&
 rm objectnav_hm3d_v1.zip
 ```
 
-## :weight_lifting: Downloading weights for various models
-The weights for MobileSAM, GroundingDINO, and PointNav must be saved to the `data/` directory. The weights can be downloaded from the following links:
-- `mobile_sam.pt`:  https://github.com/ChaoningZhang/MobileSAM
-- `groundingdino_swint_ogc.pth`: https://github.com/IDEA-Research/GroundingDINO
-- `yolov7-e6e.pt`: https://github.com/WongKinYiu/yolov7
-- `pointnav_weights.pth`: included inside the [data](data) subdirectory
-
 ## :arrow_forward: Evaluation within Habitat
 To run evaluation, various models must be loaded in the background first. This only needs to be done once by running the following command:
 ```bash
@@ -137,11 +166,28 @@ Run the following to evaluate on the HM3D dataset:
 ```bash
 python -m vlfm.run
 ```
+To run the evaluation and save the video of the simulator:
+```
+python -m vlfm.run habitat_baselines.eval.video_option='["disk"]'
+```
 To evaluate on MP3D, run the following:
 ```bash
 python -m vlfm.run habitat.dataset.data_path=data/datasets/objectnav/mp3d/val/val.json.gz
 ```
 
+## :arrow_forward: Run on Spot
+To run program on spot:
+
+To edit goal object edit env.goal in `config\experiments\reality.yaml`
+
+```bash
+export SPOT_ADMIN_PW=<YOUR_SPOT_ADMIN_PW>
+export SPOT_IP=<SPOT_IP>
+
+python -m vlfm.reality.run_bdsw_objnav_env
+```
+
+
 ## :newspaper: License
 
 VLFM is released under the [MIT License](LICENSE). This code was produced as part of Naoki Yokoyama's internship at the Boston Dynamics AI Institute in Summer 2023 and is provided "as is" without active maintenance. For questions, please contact [Naoki Yokoyama](http://naoki.io) or [Jiuguang Wang](https://www.robo.guru).

diff --git a/config/experiments/reality.yaml b/config/experiments/reality.yaml
@@ -8,7 +8,8 @@ defaults:
 
 policy:
     name: "RealityITMPolicyV2"
-    pointnav_policy_path: "data/pointnav_weights.pth"
+    #pointnav_policy_path: "data/pointnav_weights.pth"
+    pointnav_policy_path: "data/spot_pointnav_weights.pth"
     depth_image_shape: [212, 240]  # height, width
     pointnav_stop_radius: 0.9
     use_max_confidence: False

diff --git a/vlfm/vlm/detections.py b/vlfm/vlm/detections.py
@@ -211,7 +211,7 @@ def draw_bounding_box(
 
     if color is None:
         # Randomly choose a color from the rainbow colormap (so boxes aren't black)
-        single_pixel = np.array([[np.random.randint(0, 256)]])
+        single_pixel = np.array([[np.random.randint(0, 256, dtype=np.uint8)]])
         single_pixel = cv2.applyColorMap(single_pixel, cv2.COLORMAP_RAINBOW)
 
         # reshape to a single dimensional array

diff --git a/vlfm/vlm/server_wrapper.py b/vlfm/vlm/server_wrapper.py
@@ -32,7 +32,8 @@ def process_request() -> Dict[str, Any]:
         payload = request.json
         return jsonify(model.process_payload(payload))
 
-    app.run(host="localhost", port=port)
+    app.run(host="0.0.0.0", port=port) # app.run(host="localhost", port=port)
+
 
 
 def bool_arr_to_str(arr: np.ndarray) -> str: