This document describes the coordinate systems of the Xreal Glass used in the NRSDK for Unity. It also describes the corresponding interfaces for getting extrinsics between the glass components, camera image data, and camera intrinsics, as well as conversion to other definition of coordinate systems.Note that this document is applicable to the NRSDK for Unity only, and does not apply to other types of NRSDK.
Unity-based Coordinate Systems
In the NRSDK for Unity, in terms of coordinate system's definition and the corresponding extrinsics, we use definition of the Unity coordinate system (left handed).
XREAL Glass Components and Their Unity-based Coordinate Systems
The XREAL glasses consists of the following key components
2 x Grayscale Cameras
2 x Display Cameras
Head / IMU
RGB Camera
The placement of the above components and their corresponding coordinate systems, as defined in NRSDK for Unity, are as follows
The global coordinate frame of the tracking system is as follows
Interface for Head Pose
The following Interface returns the 6dof head pose with respect to the global frame, as defined above.
// Get the pose of device in unity world coordinate.// "NRCameraRig" transform in Unity.Pose headpose =NRFrame.HeadPose;
Interface for Extrinsics Between Components
The following Interface returns the 6dof extrinsics, as a transformation matrix, of a Device's coordinate frame expressed in the Head coordinate frame.
For example, given a vector's coordinate Pd​ in the Device's coordinate frame, and using the extrinsic transformation matrix hTd​ obtained as above, we can compute the vector's coordinate Ph​ in the Head coordinate frame, by Ph​=hTd​∗Pd​
Example 1: Getting the Extrinsics of RGB Camera From Head
The following example code gets the extrinsic transformation of RGB Camera in Head, and transforms a point's coordinate from the RGB camera frame to the Head frame.
// Get Pose RGBCamera From HeadPose camPos =NRFrame.GetDevicePoseFromHead(NativeDevice.RGB_CAMERA);// Translate Pose to Matrix4x4.Matrix4x4 Head_T_cam =Matrix4x4.TRS(camPos.position,camPos.rotation,Vector3.one);// Transform a vector from camera space to head spaceVector3 pInCam =newVector3(1,0,0);Vector3 pInHead =Head_T_cam.MultiplyPoint(pInCam);
Converting to OpenCV-based Coordinate Systems
For computer vision algorithm developers, it is often convenient to handle quantities expressed in the OpenCV coordinate system (right handed). Hereafter, we describe how to convert the aforementioned Unity coordinate systems and their corresponding extrinsics to the OpenCV convention. We also describe the definitions and interfaces for image data and camera intrinsics.
Xreal Glass Components and Their OpenCV-based Coordinate Systems
In the OpenCV convention, the Xreal Glass components and their corresponding coordinate systems are as follows
Converting Extrinsics: From Unity to OpenCV
The definition difference between Unity and OpenCV coordinate systems for a camera is as follows
Note that only the y-axis needs to be negated between these two conventions. Therefore, given an extrinsic transformation defined under the Unity coordinate systems, we can obtain the equivalent transformation defined under the OpenCV, by using the following utility function
Example 2: Converting the Extrinsics of RGB Camera From Head to OpenCV
The following example code first gets the extrinsic transformation of RGB Camera in Head, under the Unity coordinate systems as described earlier, and then converts it to the OpenCV coordinate systems by using the above utility function.
// Get Pose RGBCamera From HeadPose camInHead =NRFrame.GetDevicePoseFromHead(NativeDevice.RGB_CAMERA);// Translate Pose to Matrix4x4.Matrix4x4 unityHead_T_unitycam =Matrix4x4.TRS(camInHead.position,camInHead.rotation,Vector3.one);// Convert from Unity to OpenCVMatrix4x4 cvHead_T_cvcam =UnityToCVMatrix(unityHead_T_unitycam);
Example 3: Getting the Extrinsics of Right Grayscale Camera From Left Grayscale Camera in OpenCV
The following example code shows how to get the extrinsic transformation between the two Grayscale cameras and convert it to the OpenCV coordinate systems.
// Get Extrinsic Left Grayscale Camera From HeadPose lCamPos =NRFrame.GetDevicePoseFromHead(NativeDevice.LEFT_GRAYSCALE_CAMERA);Matrix4x4 Head_T_Lcam =Matrix4x4.TRS(lCamPos.position,lCamPos.rotation,Vector3.one);// Get Extrinsic Right Grayscale Camera From HeadPose rCamPos =NRFrame.GetDevicePoseFromHead(NativeDevice.RIGHT_GRAYSCALE_CAMERA); Matrix4x4 Head_T_Rcam =Matrix4x4.TRS(rCamPos.position,rCamPos.rotation,Vector3.one);// Calculate Extrinsic Right Camera From LeftMatrix4x4 unityLcam_T_unityRcam =Head_T_Lcam.inverse* Head_T_Rcam;// Convert Unity Extrinsic to CVMatrix4x4 cvLcam_T_cvRcam =UnityToCVMatrix(unityLcam_T_unityRcam);// Transform a vector from right camera to left cameraVector3 pInRCam =newVector3(1,0,0);Vector3 pInLCam =cvLcam_T_cvRcam.MultiplyPoint(pInRCam);
Image Pixel Coordinate System and Camera Intrinsics in OpenCV
The definition of the image pixel coordinates and the camera intrinsics in the NRSDK follows the OpenCV convention.
The image data is stored row-wise in memory as follows
The camera intrinsic matrix K is composed of the focal lengths fx​ and fy​, and the principal point cx​ and cy​, expressed in pixel units.
K=​fx​00​0fy​0​cx​cy​1​​
The distortion parameters contain radial coefficients k1​,k2​,k3​,k4​,k5​ and tangential coefficients p1​,p2​. The order of NRDistortionParams is (k1​,k2​,p1​,p2​,k3​,k4​,k5​).
Interface for Camera Image Data
Raw image data can be obtained through NRRGBCamTexture or NRGrayCameraTexture for the RGBCamera or GrayCamera, respectively.
Example 4: Getting the RGB Camera's Image as Raw Byte Array
In the current version of NRSDK, one can use Texture2D to get the raw image data. The following example code uses GetRawTextureData to get raw data by accessing Texture2D from NRRGBCamTexture. The output raw data array stores the image pixel data row-wise as described above.
// Here are parts of the example code for using RGB camera, you can find the // complete code in the CameraCaptureController file of NRSDK.publicclassCameraCaptureController:MonoBehaviour{ // Save the reference for Texture2D from NRRGBCamTexture.Texture2D mTex2d; // The instance of NRRGBCamTexture.NRRGBCamTexture mCamTex;voidStart() { // Create an instance of NRRGBCamTexture mCamTex =newNRRGBCamTexture(); // Get Texture2D target and save it. mTex2d =mCamTex.GetTexture();mCamTex.Play(); }voidLateUpdate() { // Get raw data from Texture2D per frame.byte[] rawData =mTex2d.GetRawTextureData(); }}
Interface for Camera Intrinsics and Distortion
The interfaces for getting camera intrinsics, distortion parameters, and resolution are as follows
publicclassNRFrame{ // Get the intrinsic matrix of device.publicstaticNativeMat3fGetDeviceIntrinsicMatrix(NativeDevice device); // Get the distortion coefficients of device.publicstaticNRDistortionParamsGetDeviceDistortion(NativeDevice device); // Get the resolution of device.publicstaticNativeResolutionGetDeviceResolution(NativeDevice device);}
Example 5: Getting the RGB Camera's Intrinsic Parameters
The following example code gets the RGB camera's intrinsic matrix and distortion parameters as described above.
// Get the rgb camera's intrinsic matrixNativeMat3f mat =NRFrame.GetDeviceIntrinsicMatrix(NativeDevice.RGB_CAMERA);// Get the rgb camera's distortion coeffcientsNRDistortionParams distort =NRFrame.GetDeviceDistortion(NativeDevice.RGB_CAMERA);