Self-managed data source: Hive

更新时间:
复制 MD 格式

Learn how to create a Hive data source.

Prerequisites

  • Ensure you have network connectivity:

    • If you connect Quick BI to your Hive database over the public network, add the IP addresses of Quick BI to the database whitelist. For more information, see Add a security group rule.

    • If you connect Quick BI to your Hive database over a private network, you must set up a bastion host and use an SSH tunnel to access the database.

  • You have created a Hive database.

  • You have the username and password for the Hive database.

Limitations

Supports Apache Hive versions 2.0.1 to 2.3.3.

Procedure

  1. Log on to the Quick BI console.

  2. Follow these steps to add a data source.

    1. Follow the instructions in Create a data source to navigate to the data source creation page.

    2. On the Self-managed data sources tab, select Hive.image

  3. In the Configure Connection dialog box, configure the following parameters.

    Parameter

    Description

    Display name

    Enter a name for the data source. The name cannot contain special characters or leading or trailing spaces.

    Database address

    The IP address or domain name of your Hive database.

    Port

    The port number used to connect to the database.

    Database

    The name of the database to connect to.

    Username

    The username to access the database.

    Password

    The password for the specified username.

    SSH

    Select this option to connect to the data source through an SSH tunnel. If you select this option, configure the following parameters:

    • SSH host: The hostname or IP address of the bastion host.

    • SSH username: The username for the SSH connection.

    • SSH password: The password for the specified SSH username.

    • SSH port: The port number for the SSH connection. The default value is 22.

    Initialization SQL

    If you enable this option, you can specify SQL statements that run after a connection is established.

    Only SET statements are allowed. Separate multiple statements with semicolons (;).

    VPC data source

    Select this option if your database is deployed on an Alibaba Cloud ECS instance in a VPC. If you select this option, you must specify the private IP address and port in the Database Address and Port parameters.

    • Purchaser's AccessKey ID and Purchaser's AccessKey Secret: The AccessKey ID and AccessKey Secret of the account that owns the ECS instance. Ensure that this account has read permissions on the target instance. If the account also has write permissions, Quick BI automatically adds its IP addresses to the whitelist. Otherwise, you must add them manually. For more information, see Create a custom policy.

      For more information, see Obtain an AccessKey pair.

    • Instance ID: The ID of the ECS instance. You can find the instance ID on the Instances page in the ECS console.

      For more information, see View instance information.

    • Region: The region where the ECS instance is located. You can find the region in the upper-left corner of the ECS console.

      For more information, see View instance information.

  4. Click Test Connection to verify the data source connection.

  5. After the connection is successfully tested, click OK to add the data source.

Next steps

After you create the data source, you can create a dataset and analyze data.