{{/* Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to you under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. */}}
| Key | Default | Type | Description | 
|---|---|---|---|
| auto-create | false | Boolean | Whether to create underlying storage when reading and writing the table. | 
| bucket | 1 | Integer | Bucket number for file store. | 
| bucket-key | (none) | String | Specify the paimon distribution policy. Data is assigned to each bucket according to the hash value of bucket-key. If you specify multiple fields, delimiter is ','. If not specified, the primary key will be used; if there is no primary key, the full row will be used. | 
| cache-page-size | 16 kb | MemorySize | Memory page size for caching. | 
| changelog-producer | none | Enum | Whether to double write to a changelog file. This changelog file keeps the details of data changes, it can be read directly during stream reads. This can be applied to tables with primary keys. Possible values: 
 | 
| changelog-producer.row-deduplicate | false | Boolean | Whether to generate -U, +U changelog for the same record. This configuration is only valid for the changelog-producer is lookup or full-compaction. | 
| commit.callback.#.param | (none) | String | Parameter string for the constructor of class #. Callback class should parse the parameter by itself. | 
| commit.callbacks | (none) | String | A list of commit callback classes to be called after a successful commit. Class names are connected with comma (example: com.test.CallbackA,com.sample.CallbackB). | 
| commit.force-compact | false | Boolean | Whether to force a compaction before commit. | 
| compaction.max-size-amplification-percent | 200 | Integer | The size amplification is defined as the amount (in percentage) of additional storage needed to store a single byte of data in the merge tree for changelog mode table. | 
| compaction.max.file-num | 50 | Integer | For file set [f_0,...,f_N], the maximum file number to trigger a compaction for append-only table, even if sum(size(f_i)) < targetFileSize. This value avoids pending too much small files, which slows down the performance. | 
| compaction.min.file-num | 5 | Integer | For file set [f_0,...,f_N], the minimum file number which satisfies sum(size(f_i)) >= targetFileSize to trigger a compaction for append-only table. This value avoids almost-full-file to be compacted, which is not cost-effective. | 
| compaction.optimization-interval | (none) | Duration | Implying how often to perform an optimization compaction, this configuration is used to ensure the query timeliness of the read-optimized system table. | 
| compaction.size-ratio | 1 | Integer | Percentage flexibility while comparing sorted run size for changelog mode table. If the candidate sorted run(s) size is 1% smaller than the next sorted run's size, then include next sorted run into this candidate set. | 
| consumer-id | (none) | String | Consumer id for recording the offset of consumption in the storage. | 
| consumer.expiration-time | (none) | Duration | The expiration interval of consumer files. A consumer file will be expired if it's lifetime after last modification is over this value. | 
| consumer.ignore-progress | false | Boolean | Whether to ignore consumer progress for the newly started job. | 
| consumer.mode | exactly-once | Enum | Specify the consumer consistency mode for table. Possible values: 
 | 
| continuous.discovery-interval | 10 s | Duration | The discovery interval of continuous reading. | 
| cross-partition-upsert.bootstrap-parallelism | 10 | Integer | The parallelism for bootstrap in a single task for cross partition upsert. | 
| cross-partition-upsert.index-ttl | (none) | Duration | The TTL in rocksdb index for cross partition upsert (primary keys not contain all partition fields), this can avoid maintaining too many indexes and lead to worse and worse performance, but please note that this may also cause data duplication. | 
| deduplicate.ignore-delete | false | Boolean | Whether to ignore delete records in deduplicate mode. | 
| dynamic-bucket.assigner-parallelism | (none) | Integer | Parallelism of assigner operator for dynamic bucket mode, it is related to the number of initialized bucket, too small will lead to insufficient processing speed of assigner. | 
| dynamic-bucket.initial-buckets | (none) | Integer | Initial buckets for a partition in assigner operator for dynamic bucket mode. | 
| dynamic-bucket.target-row-num | 2000000 | Long | If the bucket is -1, for primary key table, is dynamic bucket mode, this option controls the target row number for one bucket. | 
| dynamic-partition-overwrite | true | Boolean | Whether only overwrite dynamic partition when overwriting a partitioned table with dynamic partition columns. Works only when the table has partition keys. | 
| file-reader-async-threshold | 10 mb | MemorySize | The threshold for read file async. | 
| file.compression | (none) | String | Default file compression format, orc is lz4 and parquet is snappy. It can be overridden by file.compression.per.level | 
| file.compression.per.level | Map | Define different compression policies for different level, you can add the conf like this: 'file.compression.per.level' = '0:lz4,1:zstd'. | |
| file.format | orc | Enum | Specify the message format of data files, currently orc, parquet and avro are supported. Possible values: 
 | 
| file.format.per.level | Map | Define different file format for different level, you can add the conf like this: 'file.format.per.level' = '0:avro,3:parquet', if the file format for level is not provided, the default format which set by `file.format` will be used. | |
| first-row.ignore-delete | false | Boolean | Whether to ignore delete records in first-row mode. | 
| full-compaction.delta-commits | (none) | Integer | Full compaction will be constantly triggered after delta commits. | 
| incremental-between | (none) | String | Read incremental changes between start snapshot (exclusive) and end snapshot, for example, '5,10' means changes between snapshot 5 and snapshot 10. | 
| incremental-between-scan-mode | delta | Enum | Scan kind when Read incremental changes between start snapshot (exclusive) and end snapshot, 'delta' for scan newly changed files between snapshots, 'changelog' scan changelog files between snapshots. Possible values: 
 | 
| incremental-between-timestamp | (none) | String | Read incremental changes between start timestamp (exclusive) and end timestamp, for example, 't1,t2' means changes between timestamp t1 and timestamp t2. | 
| local-merge-buffer-size | (none) | MemorySize | Local merge will buffer and merge input records before they're shuffled by bucket and written into sink. The buffer will be flushed when it is full. Mainly to resolve data skew on primary keys. We recommend starting with 64 mb when trying out this feature. | 
| local-sort.max-num-file-handles | 128 | Integer | The maximal fan-in for external merge sort. It limits the number of file handles. If it is too small, may cause intermediate merging. But if it is too large, it will cause too many files opened at the same time, consume memory and lead to random reading. | 
| lookup.cache-file-retention | 1 h | Duration | The cached files retention time for lookup. After the file expires, if there is a need for access, it will be re-read from the DFS to build an index on the local disk. | 
| lookup.cache-max-disk-size | 9223372036854775807 bytes | MemorySize | Max disk size for lookup cache, you can use this option to limit the use of local disks. | 
| lookup.cache-max-memory-size | 256 mb | MemorySize | Max memory size for lookup cache. | 
| lookup.cache.bloom.filter.enabled | true | Boolean | Whether to enable the bloom filter for lookup cache. | 
| lookup.cache.bloom.filter.fpp | 0.05 | Double | Define the default false positive probability for lookup cache bloom filters. | 
| lookup.hash-load-factor | 0.75 | Float | The index load factor for lookup. | 
| manifest.format | avro | Enum | Specify the message format of manifest files. Possible values: 
 | 
| manifest.full-compaction-threshold-size | 16 mb | MemorySize | The size threshold for triggering full compaction of manifest. | 
| manifest.merge-min-count | 30 | Integer | To avoid frequent manifest merges, this parameter specifies the minimum number of ManifestFileMeta to merge. | 
| manifest.target-file-size | 8 mb | MemorySize | Suggested file size of a manifest file. | 
| merge-engine | deduplicate | Enum | Specify the merge engine for table with primary key. Possible values: 
 | 
| metadata.stats-mode | "truncate(16)" | String | The mode of metadata stats collection. none, counts, truncate(16), full is available. 
 
 
 
 
 | 
| metastore.partitioned-table | false | Boolean | Whether to create this table as a partitioned table in metastore. For example, if you want to list all partitions of a Paimon table in Hive, you need to create this table as a partitioned table in Hive metastore. This config option does not affect the default filesystem metastore. | 
| metastore.tag-to-partition | (none) | String | Whether to create this table as a partitioned table for mapping non-partitioned table tags in metastore. This allows the Hive engine to view this table in a partitioned table view and use partitioning field to read specific partitions (specific tags). | 
| metastore.tag-to-partition.preview | none | Enum | Whether to preview tag of generated snapshots in metastore. This allows the Hive engine to query specific tag before creation. Possible values: 
 | 
| num-levels | (none) | Integer | Total level number, for example, there are 3 levels, including 0,1,2 levels. | 
| num-sorted-run.compaction-trigger | 5 | Integer | The sorted run number to trigger compaction. Includes level0 files (one file one sorted run) and high-level runs (one level one sorted run). | 
| num-sorted-run.stop-trigger | (none) | Integer | The number of sorted runs that trigger the stopping of writes, the default value is 'num-sorted-run.compaction-trigger' + 1. | 
| orc.bloom.filter.columns | (none) | String | A comma-separated list of columns for which to create a bloom filter when writing. | 
| orc.bloom.filter.fpp | 0.05 | Double | Define the default false positive probability for bloom filters. | 
| orc.column.encoding.direct | (none) | Integer | Comma-separated list of fields for which dictionary encoding is to be skipped in orc. | 
| orc.dictionary.key.threshold | (none) | Integer | If the number of distinct keys in a dictionary is greater than this fraction of the total number of non-null rows, turn off dictionary encoding in orc. Use 1 to always use dictionary encoding. | 
| orc.write.batch-size | 1024 | Integer | write batch size for orc. | 
| page-size | 64 kb | MemorySize | Memory page size. | 
| parquet.enable.dictionary | (none) | Integer | Turn off the dictionary encoding for all fields in parquet. | 
| partial-update.ignore-delete | false | Boolean | Whether to ignore delete records in partial-update mode. | 
| partition | (none) | String | Define partition by table options, cannot define partition on DDL and table options at the same time. | 
| partition.default-name | "__DEFAULT_PARTITION__" | String | The default partition name in case the dynamic partition column value is null/empty string. | 
| partition.expiration-check-interval | 1 h | Duration | The check interval of partition expiration. | 
| partition.expiration-time | (none) | Duration | The expiration interval of a partition. A partition will be expired if it‘s lifetime is over this value. Partition time is extracted from the partition value. | 
| partition.timestamp-formatter | (none) | String | The formatter to format timestamp from string. It can be used with 'partition.timestamp-pattern' to create a formatter using the specified value. 
 | 
| partition.timestamp-pattern | (none) | String | You can specify a pattern to get a timestamp from partitions. The formatter pattern is defined by 'partition.timestamp-formatter'. 
 | 
| primary-key | (none) | String | Define primary key by table options, cannot define primary key on DDL and table options at the same time. | 
| read.batch-size | 1024 | Integer | Read batch size for orc and parquet. | 
| rowkind.field | (none) | String | The field that generates the row kind for primary key table, the row kind determines which data is '+I', '-U', '+U' or '-D'. | 
| scan.bounded.watermark | (none) | Long | End condition "watermark" for bounded streaming mode. Stream reading will end when a larger watermark snapshot is encountered. | 
| scan.file-creation-time-millis | (none) | Long | After configuring this time, only the data files created after this time will be read. It is independent of snapshots, but it is imprecise filtering (depending on whether or not compaction occurs). | 
| scan.manifest.parallelism | (none) | Integer | The parallelism of scanning manifest files, default value is the size of cpu processor. Note: Scale-up this parameter will increase memory usage while scanning manifest files. We can consider downsize it when we encounter an out of memory exception while scanning | 
| scan.max-splits-per-task | 10 | Integer | Max split size should be cached for one task while scanning. If splits size cached in enumerator are greater than tasks size multiply by this value, scanner will pause scanning. | 
| scan.mode | default | Enum | Specify the scanning behavior of the source. Possible values: 
 | 
| scan.plan-sort-partition | false | Boolean | Whether to sort plan files by partition fields, this allows you to read according to the partition order, even if your partition writes are out of order. It is recommended that you use this for streaming read of the 'append-only' table. By default, streaming read will read the full snapshot first. In order to avoid the disorder reading for partitions, you can open this option. | 
| scan.snapshot-id | (none) | Long | Optional snapshot id used in case of "from-snapshot" or "from-snapshot-full" scan mode | 
| scan.tag-name | (none) | String | Optional tag name used in case of "from-snapshot" scan mode. | 
| scan.timestamp-millis | (none) | Long | Optional timestamp used in case of "from-timestamp" scan mode. If there is no snapshot earlier than this time, the earliest snapshot will be chosen. | 
| sequence.auto-padding | (none) | String | Specify the way of padding precision, if the provided sequence field is used to indicate "time" but doesn't meet the precise. 
 | 
| sequence.field | (none) | String | The field that generates the sequence number for primary key table, the sequence number determines which data is the most recent. | 
| sink.watermark-time-zone | "UTC" | String | The time zone to parse the long watermark value to TIMESTAMP value. The default value is 'UTC', which means the watermark is defined on TIMESTAMP column or not defined. If the watermark is defined on TIMESTAMP_LTZ column, the time zone of watermark is user configured time zone, the value should be the user configured local time zone. The option value is either a full name such as 'America/Los_Angeles', or a custom timezone id such as 'GMT-08:00'. | 
| snapshot.expire.clean-empty-directories | true | Boolean | Whether to try to clean empty directories when expiring snapshots. Note that trying to clean directories might throw exceptions in filesystem, but in most cases it won't cause problems. | 
| snapshot.expire.execution-mode | sync | Enum | Specifies the execution mode of expire. Possible values: 
 | 
| snapshot.expire.limit | 10 | Integer | The maximum number of snapshots allowed to expire at a time. | 
| snapshot.num-retained.max | 2147483647 | Integer | The maximum number of completed snapshots to retain. Should be greater than or equal to the minimum number. | 
| snapshot.num-retained.min | 10 | Integer | The minimum number of completed snapshots to retain. Should be greater than or equal to 1. | 
| snapshot.time-retained | 1 h | Duration | The maximum time of completed snapshots to retain. | 
| sort-engine | loser-tree | Enum | Specify the sort engine for table with primary key. Possible values: 
 | 
| sort-spill-buffer-size | 64 mb | MemorySize | Amount of data to spill records to disk in spilled sort. | 
| sort-spill-threshold | (none) | Integer | If the maximum number of sort readers exceeds this value, a spill will be attempted. This prevents too many readers from consuming too much memory and causing OOM. | 
| source.split.open-file-cost | 4 mb | MemorySize | Open file cost of a source file. It is used to avoid reading too many files with a source split, which can be very slow. | 
| source.split.target-size | 128 mb | MemorySize | Target size of a source split when scanning a bucket. | 
| streaming-read-mode | (none) | Enum | The mode of streaming read that specifies to read the data of table file or log Possible values: 
 
 Possible values: 
 | 
| streaming-read-overwrite | false | Boolean | Whether to read the changes from overwrite in streaming mode. Cannot be set to true when changelog producer is full-compaction or lookup because it will read duplicated changes. | 
| tag.automatic-creation | none | Enum | Whether to create tag automatically. And how to generate tags. Possible values: 
 | 
| tag.callback.#.param | (none) | String | Parameter string for the constructor of class #. Callback class should parse the parameter by itself. | 
| tag.callbacks | (none) | String | A list of commit callback classes to be called after a successful tag. Class names are connected with comma (example: com.test.CallbackA,com.sample.CallbackB). | 
| tag.creation-delay | 0 ms | Duration | How long is the delay after the period ends before creating a tag. This can allow some late data to enter the Tag. | 
| tag.creation-period | daily | Enum | What frequency is used to generate tags. Possible values: 
 | 
| tag.num-retained-max | (none) | Integer | The maximum number of tags to retain. | 
| snapshot.watermark-idle-timeout | (none) | Duration | In watermarking, if a source remains idle beyond the specified timeout duration, it triggers snapshot advancement and facilitates tag creation. | 
| tag.period-formatter | with_dashes | Enum | The date format for tag periods. Possible values: 
 | 
| target-file-size | 128 mb | MemorySize | Target size of a file. | 
| write-buffer-for-append | false | Boolean | This option only works for append-only table. Whether the write use write buffer to avoid out-of-memory error. | 
| write-buffer-size | 256 mb | MemorySize | Amount of data to build up in memory before converting to a sorted on-disk file. | 
| write-buffer-spillable | (none) | Boolean | Whether the write buffer can be spillable. Enabled by default when using object storage. | 
| write-manifest-cache | 0 bytes | MemorySize | Cache size for reading manifest files for write initialization. | 
| write-max-writers-to-spill | 5 | Integer | When in batch append inserting, if the writer number is greater than this option, we open the buffer cache and spill function to avoid out-of-memory. | 
| write-only | false | Boolean | If set to true, compactions and snapshot expiration will be skipped. This option is used along with dedicated compact jobs. | 
| zorder.var-length-contribution | 8 | Integer | The bytes of types (CHAR, VARCHAR, BINARY, VARBINARY) devote to the zorder sort. |