csvkitHow to remove duplicates with csvkit?
csvkit is a suite of command-line tools for working with CSV data. It can be used to remove duplicates from a CSV file.
Example code
csvdedupe input.csv output.csv
Output example
Processing input.csv
Writing output.csv
The code above will take the input file input.csv and remove any duplicate rows, writing the output to output.csv.
Code explanation
csvdedupe: the command to run the deduplication processinput.csv: the input file containing the data to be deduplicatedoutput.csv: the output file where the deduplicated data will be written
Helpful links
More of Csvkit
- How to reorder columns with csvkit?
- How to rename a column with csvkit?
- How to update specific records using csvkit?
- How to install csvkit on a Mac?
- How to convert a tsv file to csv with csvkit?
- How to specify a delimiter in csvkit?
- How to convert a csv file to tab delimited with csvkit?
- How to change the encoding in csvkit?
- How to split a csv file with csvkit?
- How to install csvkit on Debian?
See more codes...