feat(storage): log additional bytes received from GCS in read path by nidhiii-27 · Pull Request #17423 · googleapis/google-cloud-python

nidhiii-27 · 2026-06-11T09:40:50Z

It has been found that GCS can occasionally send additional bytes while reading from stream. This scenario should be logged properly for debugging and tracking purposes.

Fixes: 475824752

gemini-code-assist

Code Review

This pull request introduces warning logs when more bytes are downloaded than requested from GCS, implemented in both _download.py and blob.py, along with a new unit test. The review feedback highlights several critical and medium-severity issues: an undefined headers variable in _download.py that will cause a NameError, a missing function call to _get_headers in blob.py that will fail at runtime, and a flawed unit test that does not actually execute the download loop due to incorrect mocking of download.finished and consume_next_chunk. Additionally, suggestions are provided to ensure fallback values are correctly applied when bucket or object names are None, and to align the range-handling logic in blob.py with _download.py for consistency.

It has been found that GCS can occasionally send additional bytes while reading from stream. This scenario should be logged properly for debugging and tracking purposes. Fixes: 475824752 [Generated-by: AI]

nidhiii-27 · 2026-06-11T10:23:14Z

Addressed all PR review comments.

[Generated-by: AI]

chandra-siri · 2026-06-24T11:37:25Z

@@ -66,6 +66,8 @@ def __init__(
        end=None,
        headers=None,
        retry=DEFAULT_RETRY,
+        client_info_bucket_name=None,


add these params in the doc string as well.

Done

Co-authored by AI Agent

chandra-siri · 2026-06-24T11:57:14Z

+            if self.start is not None and self.start < 0 and self.end is None:
+                requested_length = -self.start
+            elif self.start is not None and self.end is not None:
+                requested_length = self.end - self.start + 1
+            elif self.start is None and self.end is not None:
+                requested_length = self.end + 1


this if else logic , can be made more readable a

if start is not None and end is not None: len = end - start + 1 elif start is None: len = end + 1 elif start < 0: len = -start

and what about start > 0 ?

@nidhiii-27

Done. I have refactored the logic to make it cleaner and also handled the start > 0 with end is None case, which requests the remaining bytes from start to the end of the file.

Co-authored by AI Agent

[Generated-by: AI]

chandra-siri · 2026-06-24T17:12:14Z

@@ -66,6 +70,8 @@ def __init__(
        end=None,
        headers=None,
        retry=DEFAULT_RETRY,
+        client_info_bucket_name=None,


you're introduced this new param , but no caller is passing these values. @nidhiii-27

Good point. I have reverted all changes (reverted the warning logging and the unused client_info parameters) in _download.py and consolidated the warning logging logic inside blob.py after the download loop. I also expanded the requested_length calculation in blob.py to handle all range request cases correctly.

Co-authored by AI Agent

chandra-siri · 2026-06-24T19:30:35Z

instead of doing changes in this files, why not after these lines -

google-cloud-python/packages/google-cloud-storage/google/cloud/storage/blob.py

Lines 1123 to 1124 in 8cb77d9

while not download.finished:

download.consume_next_chunk(transport, timeout=timeout)

? ?

you don't have to pass bucket_name and object_name ?

Good point. I have reverted all changes (reverted the warning logging and the unused client_info parameters) in _download.py and consolidated the warning logging logic inside blob.py after the download loop. I also expanded the requested_length calculation in blob.py to handle all range request cases correctly.

Co-authored by AI Agent

…nload.py to blob.py [Generated-by: AI]

…c readability [Generated-by: AI]

…in blob.py [Generated-by: AI]

nidhiii-27 requested a review from a team as a code owner June 11, 2026 09:40

gemini-code-assist Bot reviewed Jun 11, 2026

View reviewed changes

nidhiii-27 marked this pull request as draft June 11, 2026 09:46

nidhiii-27 added ai-generated storage-feature-parity labels Jun 11, 2026

feat(storage): log additional bytes received from GCS in read path

ad8a555

It has been found that GCS can occasionally send additional bytes while reading from stream. This scenario should be logged properly for debugging and tracking purposes. Fixes: 475824752 [Generated-by: AI]

nidhiii-27 force-pushed the feat/log-additional-bytes branch from d7a9e73 to ad8a555 Compare June 11, 2026 10:23

Fix python formatting

bb8cd28

parthea assigned nidhiii-27 Jun 11, 2026

nidhiii-27 marked this pull request as ready for review June 16, 2026 05:50

fix(storage): log warning on byte count mismatch in gRPC bidi reads

d0dc047

[Generated-by: AI]

nidhiii-27 force-pushed the feat/log-additional-bytes branch from b09b205 to d0dc047 Compare June 24, 2026 08:34

chandra-siri requested changes Jun 24, 2026

View reviewed changes

fix(storage): address reviewer comments on media download.py

8038940

[Generated-by: AI]

chandra-siri requested changes Jun 24, 2026

View reviewed changes

nidhiii-27 added 3 commits June 25, 2026 14:26

fix(storage): move warning logging of extra bytes from low-level _dow…

22c166e

…nload.py to blob.py [Generated-by: AI]

fix(storage): revert grpc bidi warning changes and improve range logi…

01ffd42

…c readability [Generated-by: AI]

refactor(storage): restore original nested structure for range logic …

88606e3

…in blob.py [Generated-by: AI]

	while not download.finished:
	download.consume_next_chunk(transport, timeout=timeout)

Uh oh!

Conversation

nidhiii-27 commented Jun 11, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nidhiii-27 commented Jun 11, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants