-
Notifications
You must be signed in to change notification settings - Fork 78
Show metadata size #3343
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Show metadata size #3343
Conversation
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #3343 +/- ##
=======================================
Coverage 89.76% 89.76%
=======================================
Files 29 29
Lines 31289 31295 +6
Branches 5737 5738 +1
=======================================
+ Hits 28086 28092 +6
Misses 1794 1794
Partials 1409 1409
Flags with carried forward coverage won't be shown. Click here to find out more.
🚀 New features to boost your workflow:
|
|
Bytes are better. How does this interact with the old "has metadata" flag? What happens when there is metadata but there's no schema (I think I see what it is not from rereading the code, but it would really help to use some variables and not try to make everything maximally cryptic). Some simple unit tests here based on the output of metadata_details covering the various possibilities would be sufficient. |
|
I'll go with bytes then. This replaces the "has_metadata" column (which I think is the right way to go: if there is text in there, it means either a schema is set or there is metadata). I'm trying not to bloat the tables with lots of extra text / columns, so if we are going with bytes, how about No metadata, no schema = So e.g. ? Other suggestions welcome, obviously. I could put e.g. an empty marker (e.g. The alternative is to have 2 columns, labelled e.g. |
|
I think it's confusing to omit the bytes value when it's empty, just say "0 Bytes". I think it's probably easier to understand if we have two columns all right. |

Fixes #2637 - I remembered this issue when writing some recent tutorial material.
I went for the absolute size rather than percentage, and simply appended the codec type, rather than having a separate column, as it seems simpler:
I will add tests if this seems like the right format: