DBMS Musings: Apache Arrow vs. Parquet and ORC: Do we really need a third Apache project for columnar data representation?
![Performance comparison of different file formats and storage engines in the Hadoop ecosystem | Databases at CERN blog Performance comparison of different file formats and storage engines in the Hadoop ecosystem | Databases at CERN blog](https://db-blog-multimedia.web.cern.ch/db-blog-multimedia/zbaranow/hadoop_formats/chep2016_summary.jpg)
Performance comparison of different file formats and storage engines in the Hadoop ecosystem | Databases at CERN blog
![Big Data File Formats, Explained. Parquet vs ORC vs AVRO vs JSON. Which… | by 💡Mike Shakhomirov | Towards Data Science Big Data File Formats, Explained. Parquet vs ORC vs AVRO vs JSON. Which… | by 💡Mike Shakhomirov | Towards Data Science](https://miro.medium.com/v2/resize:fit:1400/0*INWiFWNRNOvCgDq0.png)
Big Data File Formats, Explained. Parquet vs ORC vs AVRO vs JSON. Which… | by 💡Mike Shakhomirov | Towards Data Science
![The impact of columnar file formats on SQL‐on‐hadoop engine performance: A study on ORC and Parquet - Ivanov - 2020 - Concurrency and Computation: Practice and Experience - Wiley Online Library The impact of columnar file formats on SQL‐on‐hadoop engine performance: A study on ORC and Parquet - Ivanov - 2020 - Concurrency and Computation: Practice and Experience - Wiley Online Library](https://onlinelibrary.wiley.com/cms/asset/2ba3c9f6-d94f-44f3-9dda-11e4fd91ddcb/cpe5523-fig-0006-m.jpg)