Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[destination-bigquery] Missing tables in the destination during the final steps of sync #40733

Open
1 task
killthekitten opened this issue Jul 4, 2024 · 0 comments
Labels
area/connectors Connector related issues community connectors/destination/bigquery team/destinations Destinations team's backlog type/bug Something isn't working

Comments

@killthekitten
Copy link
Contributor

Connector Name

destination-bigquery

Connector Version

2.6.3

What step the error happened?

During the sync

Relevant information

We have been seeing empty/missing destination tables more often, and the frequency of such incidents increases. During the debug, we narrowed the issue down to these BigQuery jobs:

DROP TABLE IF EXISTS `project-name`.`dataset`.`table_name`;
ALTER TABLE `project-name`.`dataset`.`table_name_airbyte_tmp` RENAME TO `table_name`;

The DROP statement executes instantly, but the ALTER TABLE is taking 10+ minutes to complete and there is no table to query during this time. I am not sure why the ALTER TABLE takes so much time, as this can happen even with small tables, but my guess is that BigQuery's Fair Scheduling policy deprioritizes our queries because of too many interactive jobs.

  1. Is there a reason that DROP-ALTER duo is not atomic? Can it be wrapped into a transaction?
  2. How do long waiting times on this operation affect the job and the instance status? We are seeing intermittent failures on the sync jobs more and more. Could it be that the jobs run into timeouts and get cancelled?
  3. Because there are a lot of failures, we also see inconsistent data state (see my comment here) quite often that is only resolved after two sync attempts.

Relevant log output

No response

Contribute

  • Yes, I want to contribute
@killthekitten killthekitten added area/connectors Connector related issues needs-triage type/bug Something isn't working labels Jul 4, 2024
@killthekitten killthekitten changed the title [destination-bigquery] Empty destination during the time when finalizing the table [destination-bigquery] Missing tables in the destination during the final steps of sync Jul 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/connectors Connector related issues community connectors/destination/bigquery team/destinations Destinations team's backlog type/bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants