Help Center > > Developer Guide> Tutorial: Using GDS to Import Data from a Remote Server> Step 1: Preparing an ECS as the GDS Server

Step 1: Preparing an ECS as the GDS Server

Updated at: Jun 24, 2019 GMT+08:00

The following parts are included in this section:

  • Preparing an ECS as the GDS Server

    Before using GDS to import and export data, prepare one or more Linux ECSs in the same VPC as the data warehouse cluster to install the GDS tool package. These ECSs serve as the GDS servers.

    • Importing data

      Before data import, upload source data files to GDS servers. Then, the GDS servers can also be called data servers.

      If source data files are uploaded to multiple servers for storage due to large data volume, you need to install, configure, and start GDS on each data server.

    • Exporting data

      Data is exported from DWS as data files to GDS servers.

  • Downloading the GDS Tool Package and SSL Certificate

    Before installing GDS, log in to the DWS management console and download the GDS tool package and SSL certificate that match the cluster version.

Preparing an ECS as the GDS Server

For details about how to purchase a Linux ECS, see Purchasing and Logging In to a Linux ECS in the Elastic Cloud Server Getting Started.

The ECS that functions as the GDS server must meet the following requirements:

  • The GDS server must use an OS supported by the GDS tool package. For details, see Downloading the GDS Tool Package and SSL Certificate.
  • The GDS server can communicate with the data warehouse cluster.
    • The data warehouse cluster connects to the GDS server using the private network address, and they must belong to the same region, VPC, and subnet.

      The data import rate is affected by the network bandwidth. Therefore, the private network address is recommended.

    • The GDS server can properly receive network access requests from the data warehouse cluster.

      Port: Plan a listening port for GDS. This port is used by the data warehouse cluster to connect to the GDS server. You need to specify the listening port when enabling GDS. If it is not specified, the default port 8098 is used.

      Firewall: If a firewall is enabled on the GDS server, add an inbound rule for the GDS listening port to allow the data warehouse cluster to connect to the GDS server. Otherwise, the connection cannot be set up.

      NOTE:

      Note that the listening port must be specified to the port that has been opened on the firewall when enabling GDS.

      The following example describes how to create an ECS as the GDS server.

      First of all, open the GDS listening port in the inbound rule of the security group where the ECS resides.

      Table 1 Inbound rule example

      Parameter

      Example Value

      Protocol

      TCP

      Port Range

      5000

      NOTE:

      Enter the listening port of the GDS server.

      Source

      Select IP Address and enter the IP address of the data warehouse cluster. Example value: 192.168.0.10/32.

      If the firewall is enabled on the ECS, ensure that the listening port of the GDS server is opened on the firewall.

      iptables  -I INPUT -p tcp -m tcp --dport <gds_port> -j ACCEPT

Downloading the GDS Tool Package and SSL Certificate

  1. Log in to the DWS management console.
  2. In the navigation tree on the left, click Connection Management.
  3. Select the DWS client of the corresponding version from the drop-down list of . The client tool package contains the GDS tool package and gsql client tool.

    Choose RedHat x64 or SUSE x64 according to the OS of the client to be installed.

    • The RedHat x64 client can be used on the following OSs:
      • RHEL 6.4/6.5/6.6/6.7/7.1/7.2
      • CentOS 6.4/6.5/6.6/6.7
      • EulerOS 2.0 SP2
    • The SUSE x64 client can be used on the following OSs:

      SLES11 SP1/SP2/SP3/SP4

  4. Click Download.
  5. (Optional) To start the GDS service in SSL encryption mode, click here to download the SSL certificate in the Download Client and Driver area.

    If GDS is started in SSL encryption mode, data transmitted between the GDS server and the data warehouse cluster is encrypted to ensure data security.

    NOTE:

    The SSL mode is more secure than other modes. You are advised to start GDS in SSL mode.

Did you find this page helpful?

Submit successfully!

Thank you for your feedback. Your feedback helps make our documentation better.

Failed to submit the feedback. Please try again later.

Which of the following issues have you encountered?







Please complete at least one feedback item.

Content most length 200 character

Content is empty.

OK Cancel