文章目录
- 数据湖技术
- 数据湖技术之Iceberg
- Flink 与 Iceberg 整合
- 1. DataFrame API 操作 Iceberg
- DataStream API实时写入Iceberg表
- DataStream API 批量/实时读取 Iceberg 表
- 指定基于快照实时增量读取数据
- 合并data files
- 2. SQL API 操作 Iceberg
- SQL API 创建 Iceberg 表并写入数据
- SQL API 批量查询 Iceberg 表数据
- SQL API 实时查询 Iceberg 表数据
- SQL API 指定基于快照实时增量读取数据
- 3. SQL API读取kafka数据实时写入Iceberg表
- Iceberg 与 Hudi 对比
-
数据湖技术
数据湖技术之Iceberg
Flink 与 Iceberg 整合
- Flink 版本:1.14.4
- iceberg:0.13.2
- Maven依赖:
<dependency><groupId>org.apache.iceberg</groupId><artifactId>iceberg-flink-runtime-1.14</artifactId><version>${iceberg.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-java</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-streaming-java_${scala.version}</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-clients_2.12</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-streaming-scala_${scala.version}</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-connector-kafka_${scala.version}</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-connector-base</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-csv</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.hadoop</groupId><artifactId>hadoop-client</artifactId><version>${hadoop.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-table-common</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-table-api-java</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-table-api-java-bridge_${scala.version}</artifactId><version>${flink.version}</version>
</dependency>
<dependency><groupId>org.apache.flink</groupId><artifactId>flink-table-planner_${scala.version}</artifactId><version>${flink.version}</version>