rgbdvslam

Feature-based visual simultaneous localization and mapping (vSLAM) and visual-inertial sensor fusion with RGB-D camera

Since R2025a

Description

Use the rgbdvslam object to perform visual simultaneous localization and mapping (vSLAM) with RGB-D camera data. RGB-D vSLAM combines depth information from sensors, such as RGB-D cameras or depth sensors, with RGB images to simultaneously estimate the camera pose and create a map of the environment. To learn more about visual SLAM, see Implement Visual SLAM in MATLAB (Computer Vision Toolbox).

The rgbdvslam object extracts Oriented FAST and Rotated BRIEF (ORB) features from incrementally read images, and then tracks those features to estimate camera poses, identify key frames, and reconstruct a 3-D environment. The vSLAM algorithm also searches for loop closures using the bag-of-features algorithm, and then optimizes the camera poses using pose graph optimization. You can enhance the accuracy and robustness of the SLAM by integrating this object with IMU data to perform visual-inertial sensor fusion.

Creation

Syntax

vslam = rgbdvslam(intrinsics)

vslam = rgbdvslam(intrinsics,depthScaleFactor)

vslam = rgbdvslam(___,imuParameters)

vslam = rgbdvslam(intrinsics,PropertyName=Value)

Description

vslam = rgbdvslam(intrinsics) creates an RGB-D visual SLAM object, vslam, by using the specified camera intrinsic parameters.

The rgbdvslam object assumes the color and the depth images have been preregistered with one-to-one correspondence.

The object represents 3-D map points and camera poses in world coordinates, and assumes the camera pose of the first key frame is an identity rigidtform3d (Image Processing Toolbox) transform.

Note

The rgbdvslam object runs on multiple threads internally, which can delay the processing of an image frame added by using the addFrame function. Additionally, the object running on multiple threads means the current frame the object is processing can be different than the recently added frame.

vslam = rgbdvslam(intrinsics,depthScaleFactor) specifies the depth correction factor of the RGB-D camera, which the camera manufacturer usually provides. Use this syntax when the depth scale factor for the sensor is not equal to 1.

vslam = rgbdvslam(___,imuParameters) performs RGB-D visual-inertial SLAM based on the specified imuParameters.

vslam = rgbdvslam(intrinsics,PropertyName=Value) sets properties using one or more name-value arguments. For example, MaxNumPoints=850 sets the maximum number of ORB feature points to extract from each image to 850.

example

Input Arguments

expand all

`intrinsics` — Camera intrinsic parameters
`cameraIntrinsics` object

Camera intrinsic parameters, specified as a cameraIntrinsics (Computer Vision Toolbox) object.

This argument sets the Intrinsics property.

`depthScaleFactor` — Depth scale factor
scalar

Depth scale factor, specified as a scalar in real-world units, such as meters. The depth scale factor is the conversion factor that relates the depth values of the depth sensor to real-world distances, and is typically expressed in the same units as the depth measurements provided by the sensor, such as millimeters, centimeters, or meters. This value provides the necessary information to transform the depth measurements into the metric scale. Use the depthScaleFactor argument when the value for the sensor you are using is not equal to 1.

For the world 3-D coordinates (X, Y, Z), where Z is the depth at any pixel coordinate (u, v), Z = P/depthScaleFactor, where P represents the intensity value of the depth image at pixel (u, v).

This argument sets the DepthScaleFactor property.

`imuParameters` — IMU parameters
`factorIMUParameters` object

IMU parameters, specified as a factorIMUParameters object. The object contains the noise, bias, and sample rate information about the inertial measurement unit (IMU).

Properties

expand all

Camera Parameters

`Intrinsics` — Camera intrinsic parameters
Read-only: `cameraIntrinsics` object

This property is read-only.

Camera intrinsic parameters, stored as a cameraIntrinsics (Computer Vision Toolbox) object.

Use the intrinsics argument to set this property.