Mass Ingestion

Back Next

Restart and recovery for incremental load jobs

Mass Ingestion Databases

can restart incremental load and combined initial and incremental load jobs that stopped because of an error or a user stop request without losing change data.

After the first job run,

Mass Ingestion Databases

continually records an identifier for the processing position in the change stream as changes are applied to the target. For file-based targets such as Amazon S3, Azure Data Lake Storage, Google Cloud Storage, and Kafka, the identifier is stored in a checkpoint file. For database targets, the identifier is stored in a generated recovery table, called INFORMATICA_CDC_RECOVERY, on the target.

For the first job run,

Mass Ingestion Databases

uses the start point that you set in the

Restart Point for Incremental Load

field when defining the database ingestion task.

If incremental job processing ends abnormally or in response to a user stop or abort request and you then resume the job, the job resumes from the last position saved to the checkpoint file or recovery table. A checkpoint will not be available unless a change record was processed for at least one of the tables during the first job run after deployment. If a checkpoint is not available, the job resumes processing from the configured restart point, which is the latest available position in the change stream by default.

Rename Saved Search

Table of Contents

Mass Ingestion

Mass Ingestion

Restart and recovery for incremental load jobs

Restart and recovery for incremental load jobs