

This research article examines the benefits of Apache Parquet, a columnar storage format, in enhancing query performance and resource efficiency in big data analytics. It compares Parquet with traditional row-based formats, highlighting its capabilities in data compression, predicate pushdown, and integration with modern analytical engines like Spark and Trino. The study includes benchmarking results, architectural insights, and best practices for data engineers and architects to optimize analyt