Fixes for import automation by vish-cs · Pull Request #1969 · datacommonsorg/data

vish-cs · 2026-04-20T07:08:41Z

Fix the condition to check for an empty diff (schema_diff_size)
Add the option to import-helper to run the batch job or the dataflow job
Update spanner client to only return staging imports

gemini-code-assist

Code Review

This pull request refactors the import differ logic to return a summary dictionary and updates the validation workflow to handle this new structure. It also introduces a conditional post-processing step for Spanner ingestion, allowing for status updates to 'STAGING' and corresponding filtering in the Spanner client. I have no feedback to provide.

ajaits · 2026-04-20T10:30:19Z

+            logging.info("Marking import as SKIP due to no data diff.")
+            import_summary.status = ImportStatus.SKIP
+        else:
+            import_summary.status = ImportStatus.STAGING


return validation_status

ajaits · 2026-04-20T10:58:54Z

        'import_version',
        datetime.now(timezone.utc).strftime("%Y-%m-%d"))
-    run_ingestion = True 
+    post_process = attributes.get('post_process', '')


Is this a new attribute? How is this used?

The name seems to indicate this is post running import workflow.
Can we rename this to run_process so when set to spanner_ingestion_workflow it is clear that is it only running a dataflow ingestion?

ajaits · 2026-04-20T11:07:13Z

                    import_input=import_input,
                    absolute_import_dir=absolute_import_dir)
+                if differ_summary is not None:
+                    diff_found = (differ_summary['obs_diff_size'] != 0 or


can we use .get() instead of []?
diff_summary.get('obs_diff_size', 0) != 0 or differ_summary,get('schema_diff_size', 0) != 0

gemini-code-assist bot reviewed Apr 20, 2026

View reviewed changes

vish-cs requested a review from ajaits April 20, 2026 07:37

vish-cs force-pushed the fix branch from c9f13a4 to e56e6b1 Compare April 20, 2026 08:00

Fixes for import automation

a75a4ce

vish-cs force-pushed the fix branch from e56e6b1 to a75a4ce Compare April 20, 2026 08:38

ajaits reviewed Apr 20, 2026

View reviewed changes

ajaits approved these changes Apr 20, 2026

View reviewed changes

ajaits reviewed Apr 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for import automation#1969

Fixes for import automation#1969
vish-cs wants to merge 1 commit intodatacommonsorg:masterfrom
vish-cs:fix

vish-cs commented Apr 20, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

ajaits Apr 20, 2026

Uh oh!

ajaits Apr 20, 2026

Uh oh!

ajaits Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vish-cs commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

ajaits Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

ajaits Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

ajaits Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vish-cs commented Apr 20, 2026 •

edited

Loading