Data Ingestion and Replication
- Data Ingestion and Replication
- All Products
Option
| Description
|
---|---|
Apply Cycle Interval
| Specifies the amount of time that must elapse before a database ingestion job ends an apply cycle. You can specify days, hours, minutes, and seconds or specify values for a subset of these time fields leaving the other fields blank.
The default value is 15 minutes.
|
Apply Cycle Change Limit
| Specifies the number of records that must be processed before a database ingestion job ends an apply cycle. When this record limit is reached, the database ingestion job ends the apply cycle and writes the change data to the target.
The default value is 10000 records.
|
Low Activity Flush Interval
| Specifies the amount of time, in hours, minutes, or both, that must elapse during a period of no change activity on the source before a database ingestion job ends an apply cycle. When this time limit is reached, the database ingestion job ends the apply cycle and writes the change data to the target.
If you do not specify a value for this option, a database ingestion job ends apply cycles only after either the
Apply Cycle Change Limit or
Apply Cycle Interval limit is reached.
No default value is provided.
|
Option
| Description
|
---|---|
Ignore
| Does not replicate DDL changes that occur on the source database to the target. For Amazon Redshift, Kafka, Microsoft Azure Synapse Analytics, or Snowflake targets, this option is the default option for the Drop Column and Rename Column operation types.
For Amazon S3, Google Cloud Storage, and Microsoft Azure Data Lake Storage targets that use the CSV output format, the
Ignore option is disabled. For the AVRO output format, this option is enabled.
|
Replicate
| Allows the database ingestion job to replicate the DDL change to the target. For Amazon S3, Google Cloud Storage, and Microsoft Azure Data Lake Storage targets, this option is the default option for all operation types. For other targets, this option is the default option for the Add Column and Modify Column operation types.
|
Stop Job
| Stops the entire database ingestion job.
|
Stop Table
| Stops processing the source table on which the DDL change occurred. When one or more of the tables are excluded from replication because of the
Stop Table schema drift option, the job state changes to
Running with Warning .
The database ingestion job cannot retrieve the data changes that occurred on the source table after the job stopped processing it. Consequently, data loss might occur on the target. To avoid data loss, you will need to resynchronize the source and target objects that the job stopped processing. Use the
Overriding schema drift options when resuming a database ingestion job.
option. For more information, see
|
Option
| Description
|
---|---|
Checkpoint All Rows
| Indicates whether a database ingestion job performs checkpoint processing for every message that is sent to the Kafka target.
If this check box is selected, the
Checkpoint Every Commit ,
Checkpoint Row Count , and
Checkpoint Frequency (secs) options are ignored.
|
Checkpoint Every Commit
| Indicates whether a database ingestion job performs checkpoint processing for every commit that occurs on the source.
|
Checkpoint Row Count
| Specifies the maximum number of messages that a database ingestion job sends to the target before adding a checkpoint. If you set this option to 0, a database ingestion job does not perform checkpoint processing based on the number of messages. If you set this option to 1, a database ingestion jobs add a checkpoint for each message.
|
Checkpoint Frequency (secs)
| Specifies the maximum number of seconds that must elapse before a database ingestion job adds a checkpoint. If you set this option to 0, a database ingestion job does not perform checkpoint processing based on elapsed time.
|