this post was submitted on 04 Jul 2025
163 points (98.2% liked)

Programmer Humor

24772 readers
945 users here now

Welcome to Programmer Humor!

This is a place where you can post jokes, memes, humor, etc. related to programming!

For sharing awful code theres also Programming Horror.

Rules

founded 2 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] wise_pancake@lemmy.ca 4 points 1 day ago (1 children)

I’d probably just use line delimited JSON or CSV for this use case. It plays nicely with cat and other standard tools and basically all the yaml is doing is wrapping raw json and adding extra parse time/complexity.

In the end consider converting this to parquet for analysis, you probably won’t get much from compression or row-group clustering, but you will get benefits from the column store format when reading the data.

[–] qaz@lemmy.world 4 points 1 day ago* (last edited 1 day ago) (1 children)

Thanks for the advice, but this is just the format of some eyetracking software I had to use not something I develop myself

[–] wise_pancake@lemmy.ca 4 points 1 day ago

Ah, well, such is software dependencies.