This was my first utility written in Python, when I was learning the basics before going deep into the Databricks world.
This program will join all the csv files of a folder in a single one.
Default folder is “csv” and default output file is “joined.csv”.
Available options:
1 -h --help Display this help screen.
2 -f --folder Sets the folder that contains the files.
3 -o --output Sets the name of the output file.
Usage example:
1py joincsv.py -f myfiles -o joinedfile.csv
Based on these articles: