About 35 results
Open links in new tab
  1. Data Sampling | Apache Kylin

    Aug 18, 2022 · Kylin provides the data sampling function to facilitate table data analysis. With data sampling, you can collect table characteristics, such as cardinality, max value, and min value for …

  2. Kylin Tutorial | Apache Kylin

    During the table metadata synchronization process, the data sampling is turned on by default. You can view the auto-launched sample table data job in the Monitor -> Job page.

  3. Advanced Settings | Apache Kylin

    Sep 26, 2024 · kylin.cube.algorithm: it defines the Cubing algorithm that the job engine will select; Its default value is "auto", means the engine will dynamically pick an algorithm ("layer" or "inmem") by …

  4. Data Source | Apache Kylin

    Aug 11, 2022 · sampling_rows - optional integer, indicates the max number of sampling rows and the range is [10,000 - 20,000,000] . Note: if you enable need_sampling, this parameter will be required.

  5. Kylin Configuration | Apache Kylin

    Sep 26, 2024 · kylin.job.sampling-percentage: specifies the data sampling percentage. The default value is 100. Auto Merge Segments kylin.cube.is-automerge-enabled: whether to enable auto-merge. The …

  6. Job Concept and Settings | Apache Kylin

    Manually: You can trigger a table sampling job in the navigation bar Data Asset -> Data Source. Click on the Sample button or Reload button to trigger this kind of jobs.

  7. Project Settings | Apache Kylin

    Aug 15, 2022 · This queue resource is used for non-query jobs such as refreshing data, merging segments, building indexes, loading data and sampling table. The system will submit the job to the …

  8. Load data source | Apache Kylin

    Aug 18, 2022 · It also creates a uniform business semantic layer that frees users from concerns about the technical complexity and implementation of the underlying data source. Click the links below to …

  9. Kylin Configuration | Apache Kylin

    Sep 26, 2024 · Kylin detects Hadoop/Hive/HBase configurations from the environments automatically, for example the "core-site.xml", the "hbase-site.xml" and others. Besides, Kylin has its own …

  10. Use Hive as Data Source | Apache Kylin

    Data sampling is enabled at the table level by default. Kylin uses table sampling to collect statistical information of source tables, such as column cardinality and formats, so you can check the …