Zendesk incremental tickets API loads duplicates

dlt version

1.17.1

Source name

zendesk

Describe the problem

The zendesk source uses time-based incremental loading. This can lead to duplicates as explained in the zendesk docs. The source should be updated to deduplicate adjacent response pages, or use (the zendesk-recommended) cursor-based exports

Expected behavior

The observed behavior can lead to duplicates. data quality issues aside, this also prevents the 'merge' incremental strategy. dlt should either use an incremental method that doesnt have this problem, or auto-dedupe

Steps to reproduce

this might not be straightforward to reproduce since it depends on the specific timestamps of tickets. hmu on slack and i can screenshare if that's helpful! you might be able to check manually with something like postman to check for tickets that exist at both the end of one page and the beginning of the subsequent page

How you are using the source?

I run this source in production.

Operating system

Linux

Runtime environment

Virtual Machine

Python version

3.11

dlt destination

iceberg + filesystem (minio)

Additional information

No response

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Zendesk incremental tickets API loads duplicates #659

dlt version

Source name

Describe the problem

Expected behavior

Steps to reproduce

How you are using the source?

Operating system

Runtime environment

Python version

dlt destination

Additional information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Zendesk incremental tickets API loads duplicates #659

Description

dlt version

Source name

Describe the problem

Expected behavior

Steps to reproduce

How you are using the source?

Operating system

Runtime environment

Python version

dlt destination

Additional information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions