GanosBase supports reading vector data stored in Alibaba Cloud Object Storage Service (OSS) buckets, self-managed MinIO buckets, and Hadoop Distributed File System (HDFS) clusters. This topic describes the path format for each storage type.
OSS paths
Use the following format to specify a vector data path in an OSS bucket:
oss://<access_id>:<secret_key>@[<endpoint>]/<bucket>/path_to/fileParameters
| Parameter | Description |
|---|---|
access_id | The AccessKey ID of the AccessKey pair used to access the OSS bucket. |
secret_key | The AccessKey secret of the AccessKey pair. To get your AccessKey pair, see Obtain an AccessKey pair. |
endpoint | (Optional) The endpoint of the OSS bucket. For available endpoints, see OSS endpoints. |
bucket | The name of the OSS bucket. |
path_to/file | The path to the object in the OSS bucket. |
Usage notes
Use the internal endpoint to make sure GanosBase can access the OSS bucket. The database cluster and the OSS bucket must be in the same region.
To use a Shapefile, upload at least the
.shp,.shx, and.dbffiles to the same folder in the OSS bucket.
MinIO paths
MinIO paths are only supported in hybrid cloud environments.
Use the following format to specify a vector data path in a MinIO bucket:
mio://<id>:<key>@<host>[:<port>]/<bucket>/path_to/fileParameters
| Parameter | Description |
|---|---|
id | The access ID for the MinIO bucket. |
key | The access key for the MinIO bucket. |
host | The host address of the MinIO server. |
port | (Optional) The port of the MinIO server. If not specified, the HTTPS port is used. |
bucket | The name of the MinIO bucket. |
path_to/file | The path to the file in the MinIO bucket. |
Usage notes
To achieve high data transfer performance, the database cluster and the MinIO bucket must be in the same region and communicate over an internal network.
HDFS paths
HDFS paths are only supported in hybrid cloud environments.
Use the following format to specify a vector data path in an HDFS cluster:
HDFS://<user_name>@<host>[:<port>]/path_to/fileParameters
| Parameter | Description |
|---|---|
user_name | (Optional) The HDFS username. Defaults to hadoop if not specified. |
host | The host address of the Remote Procedure Call (RPC) endpoint, as defined in the HDFS configuration file. |
port | (Optional) The RPC port, as defined in the HDFS configuration file. |
path_to/file | The path to the file in HDFS. |
Usage notes
To achieve high data transfer performance, the database cluster and the HDFS cluster must be in the same Virtual Private Cloud (VPC) and communicate over an internal network.