What’s your take on parquet?
I’m still reading into it. Why is it closely related to apache? Does inly apache push it? Meaning, if apache drops it, there’d be no interest from others to push it further?
It’s published under apache hadoop license. It is a permissive license. Is there a drawback to the license?
Do you use it? When?
I assume for sharing small data, csv is sufficient. Also, I assume csv is more accessible than parquet.
How does this differ from graphQL?
Parquet is a storage format; graphQL is a query language/transmission strategy.
Graphql is a protocol for interacting with a remote system, parquet is about having a local file that you can index and retrieve data from in a more efficient way. It’s especially useful when the data has a fairly well defined structure but may be large enough that you can’t or don’t want to bring it all into memory. They’re similar concepts, but different applications
Thank you!