We can use Amazon S3 Select to retrieve a subset of data from an S3 object using simple SQL statements. Since Amazon S3 Select filters a subset of data, it will help to reduce the data transfer cost and latency of an application. Amazon S3 select supports data stored in CSV, JSON, or Apache Parquet formats.
Refer to the original article that I published on https://duleendra.medium.com