Quickstart¶
This guide will help you get started with NerfBaseline, but it assumes you have already installed the package. If you haven’t, please refer to the installation guide. In this guide, we show the full process of training a Gaussian Splatting model on the Mip-NeRF 360 dataset, garden scene.
Public benchmark results¶
Before training a model, you can inspect the public benchmark to see the results of the implemented methods on standard datasets. In our case, the results of the Gaussian Splatting method can be seen here. If we want to skip the training and only inspect the public checkpoint, we can download the checkpoint from the website and extract it in the current working directory (the next section can be skipped).
Training a model¶
To train a model, you need to specify the method, the dataset, and optionally the backend. The following command will start the training and evaluate on the test set at the end of the training:
nerfbaselines train --method gaussian-splatting --data external://mipnerf360/garden
The training script will try to detect the backend automatically, but if you want to choose a specific backend, you can specify it using the --backend
argument:
nerfbaselines train --method gaussian-splatting --data external://mipnerf360/garden --backend docker
The training script will save all checkpoints, predictions, etc. into the current directory, but you can specify the output directory using the --output
argument.
By default, the training will log all metrics into a tensorboard
logger. You can specify a different logger using the --logger
argument (e.g., --logger wandb
for Weights and Biases) - see supported loggers.
Note
For some official datasets (e.g., Mip-NeRF 360, NerfStudio, Blender, or Tanks and Temples), the datasets will be downloaded automatically if you use external://dataset/scene
as the data argument.
See the (list of datasets) which can be downloaded automatically.
If you want to your own data, point --data
to the folder containing the dataset.
The format of the datasets will be detected automatically.
After the training finished, you should see the following output in the output directory:
checkpoint-30000/
predictions-30000.tar.gz
output.zip
results-30000.json
tensorboard/
The checkpoint-30000
folder contains the trained model, predictions-30000.tar.gz
contains the predictions on the test set, and results-30000.json
contains the evaluation results.
The output.zip
file contains everything in a single archive and is used when uploading the results to the benchmark.
Rendering images¶
To render images from a trained model, you can use the render
command. Let’s extend the previous example (assuming the checkpoint is in the current directory) and evaluate the model on the train set:
nerfbaselines render --checkpoint checkpoint-30000 --data external://mipnerf360/garden --split train --output predictions-train-30000.tar.gz
The predictions-train-30000.tar.gz
file will contain the rendered images and the corresponding camera poses.
Tip
You can choose to output the results in a compressed archive (.tar.gz
) or a folder. The output format is detected automatically based on the file extension.
Note
In all commands working with checkpoint, the method is detected automatically from the checkpoint folder, no need to specify it.
Launching the viewer¶
To further inspect your trained model, you can launch the interactive viewer by running:
nerfbaselines viewer --checkpoint checkpoint-30000 --data external://mipnerf360/garden
The viewer will start in the background and will allow you to connect from your web browser (the URL will be printed in the terminal).
In the viewer, you can inspect the 3D scene, construct a camera trajectory, and render a video of the trajectory. You can also
save the camera trajectory to render it later using the render-trajectory
command. In this walkthrough, we assume you created a camera trajectory
and saved it to a file trajectory.json
.
For information on how to use the viewer, please refer to the Viewer page.
Tip
By specifying the --data
argument, the viewer will use the camera poses to perform gravity alignment and rescaling for a better viewing experience.
It also enables visualizing the input camera frustums and input 3D point cloud. For some datasets (e.g., Photo Tourism datasets), it is required
if you want to visualize the differenct appearance embeddings of train images.
Rendering a camera trajectory¶
To render a camera trajectory (e.g., created using the interactive viewer), use the following command:
nerfbaselines render-trajectory --checkpoint checkpoint-30000 --trajectory trajectory.json --output trajectory.mp4
The format is automatically detected based on the --output
file extension and you can also save individual frames as images by specifying a folder (or a .tar.gz
/.zip
) instead of a file.