ORC file
Apache ORC is a columnar file format that provides optimizations to speed up queries. It is a far more efficient file format than CSV or JSON.
For more information, see ORC Files.
Options
See the following Apache Spark reference articles for supported read and write options.