Breadcrumbs

OPT_USE_RUN_IDS_INCREMENTALLY

OPT_USE_RUN_IDS_INCREMENTALLY is a load option that improves the performance of Agile Data Engine internal metadata queries in certain situations when using Run ID logic.

With OPT_USE_RUN_IDS_INCREMENTALLY = true, new run ids will only be processed from the previous load time onwards; i.e. Agile Data Engine will use the latest logged run timestamp of the load when querying loadable run ids internally.


See also:


Usage

OPT_USE_RUN_IDS_INCREMENTALLY: boolean

Default: false


Notes

OPT_USE_RUN_IDS_INCREMENTALLY is useful in cases where a lot of run ids are produced such as near real-time cases with lots of files and file specific run ids.

Use with:

OPT_USE_RUN_IDS: true

Like Run ID logic in general, OPT_USE_RUN_IDS_INCREMENTALLY can be used in table loads that have a single entity mapping (i.e. one source entity).


Examples

Using OPT_USE_RUN_IDS_INCREMENTALLY

See examples on using Run ID logic in OPT_USE_RUN_IDS.

Enable Run ID logic for a load:

OPT_USE_RUN_IDS: true

Set run ids to be processed incrementally:

OPT_USE_RUN_IDS_INCREMENTALLY: true

When the load is executed, Agile Data Engine will use the timestamp of the previous logged execution of the load when querying internal metadata for new run ids.

For example, if the load was previously run at '2022-07-11T11:48:30.784000', the load log would display:

INFO - OPT_USE_RUN_IDS_INCREMENTALLY is set, fetching RunIds from: 2022-07-11T11:48:30.784000