This is the multi-page printable view of this section. Click here to print.

Return to the regular view of this page.

Overview

    Service Overview

    Quick Query is an interactive query service that allows you to analyze large amounts of data quickly and easily using standard SQL. It is automatically installed on a standard Kubernetes cluster and provides easy and fast access to various data sources such as Cloud Hadoop, Object Storage, and RDB, enabling data retrieval and processing.

    Key Features

    • Easy and Fast Data Retrieval: After defining a schema for data stored in Object Storage, you can easily and quickly retrieve data using standard SQL. Any user who can handle SQL can easily analyze large datasets without being a professional analyst.
    • Rapid Parallel Distributed Processing: Using the Trino engine, which supports parallel distributed processing, queries are automatically divided and processed in parallel on multiple nodes, allowing you to quickly retrieve query results even for large amounts of data.
    • Various Service Structures: It provides a shared fixed resource mode, a shared resource expansion mode, and a personal resource expansion mode. The shared fixed resource mode supports a stable response speed for large data queries, while the shared resource expansion mode allows for more affordable use in cases of irregular usage. Additionally, the personal resource expansion mode supports each user’s independent analysis work, enabling the use of Quick Query with a structure that meets user demands.

    Service Composition Diagram

    Composition Diagram
    Figure. Quick Query Composition Diagram

    Provided Functions

    Quick Query provides the following functions:

    • Single Access Support for Various Data Sources (Supporting 11 Data Sources)
    • Automatic Storage Function for Result Data in Object Storage
    • Reuse Function for Query Results
    • Access Control Function through Ranger Integration
    • Data Usage Control Function
    CategoryTypeNote
    Cloud Hadoophive_on_cloud_hadoop
    iceberg_on_cloud_hadoop
    Using Cloud Hadoop’s Hive Metastore
    Object Storagehive_on_object_storage
    iceberg_on_object_storage
    Deploying Hive Metastore in Quick Query
    RDBpostgresql
    mariadb
    sqlserver
    oracle
    mysql
    JDBC Driver Upload required (licensed)
    TPCDStpcdsBuilt-in Data Source provided by Quick Query
    TPCHtpchBuilt-in Data Source provided by Quick Query
    Table. Supported Data Sources
    Typeselectinsertupdatedeletecreatedropalteranalyzecall
    hive_on_cloud_hadoopOOOOOOOOO
    iceberg_on_cloud_hadoopOOOOOOOOO
    hive_on_object_storageOOOOOOOOO
    iceberg_on_object_storageOOOOOOOOO
    postgresqlOOOOOO
    mariadbOOOOOO
    sqlserverOOOOOO
    greenplumOOOOOO
    oracleOOOOOO
    mysqlOOOOOO
    tpcdsO
    tpchO
    Table. Supported SQL

    Components

    Query Engine Type: Shared

    The query engine is a structure that is shared by multiple users when one is running.

    • Fixed Resource Mode (No Auto Scaling): When Auto Scaling is not used, the query engine runs with fixed resources according to the user’s selection. Since the query engine always runs with the same resources, it can guarantee consistent query performance.

      Diagram
      Figure. Fixed Resource Mode (No Auto Scaling)
    • Resource Expansion Mode (Using Auto Scaling): When Auto Scaling is used, the query engine’s worker nodes automatically scale in/out according to the processing volume. When the processing volume is low, the worker nodes decrease to one, and when the processing volume increases, the worker nodes increase. Additionally, resources can be adjusted according to the cluster size.

      Diagram
      Figure. Resource Expansion Mode (Using Auto Scaling)

    Query Engine Type: Personal

    • Resource Expansion Mode (Using Auto Scaling): The personal query engine type is a structure where the query engine runs separately for each user. Each query engine supports Auto Scale in/out and automatically stops when not used for an extended period. When used again, the query engine automatically restarts. The worker nodes decrease to one when the processing volume is low and increase when the processing volume increases. Additionally, resources can be adjusted according to the cluster size.

      Diagram
      Figure. Resource Expansion Mode (Using Auto Scaling)

    Server Type

    The server types supported by Quick Query are as follows:

    ClassificationExampleDetailed Description
    Server TypeStandardProvided server types
    • Standard: Standard specifications (vCPU, Memory) configuration commonly used
    • High Capacity: Server specifications with 24 cores or more
    Server Sizes1v2m4Provided server specifications
    • vCPU 2, Memory 4G
    Table. Quick Query Supported Server Types

    The minimum specifications required to use Quick Query are as follows:

    ClassificationDetailsCluster Size (User Input Value)Fixed Node PoolAuto-Scaling Node Pool
    SharedFixed Resource Mode (No Auto Scaling)Replica: 1
    CPU: 4 Core
    Memory: 8GB
    8 Core, 16GB * 4N/A
    SharedResource Expansion Mode (Using Auto Scaling)Small(1 Core, 4GB)8 Core, 16GB * 38 Core, 16GB * 1
    PersonalResource Expansion Mode (Using Auto Scaling)Small(1 Core, 4GB)8 Core, 16GB * 38 Core, 32GB * 2
    Table. Quick Query Minimum Specifications

    Region-Based Provisioning Status

    Quick Query is available in the following environments:

    RegionAvailability
    Korea West (kr-west1)Available
    Korea East (kr-east1)Available
    Korea South 1 (kr-south1)Not Available
    Korea South 2 (kr-south2)Not Available
    Korea South 3 (kr-south3)Not Available
    Table. Quick Query Region-Based Provisioning Status

    Preceding Services

    The following services must be configured before creating Quick Query. Please refer to the guides provided for each service to prepare them in advance.

    Service CategoryServiceDetailed Description
    NetworkingVPCA service that provides an independent virtual network in a cloud environment
    NetworkingSecurity GroupA virtual firewall that controls server traffic
    StorageFile StorageA storage that allows multiple client servers to share files through network connections
    Table. Quick Query Preceding Services