CcsidGuesser
A utility for guessing the CCSID of files (based on file contents).
It can also fix CCSID tags and/or convert files to UTF-8!
Usage
Usage: java -jar ccsidguesser.jar [options] <file>
Valid options include:
--show=top/topN/all: how many CCSID guesses to show (default is 'top1'), which shows the
top 1 result. A value of 'top' shows the top guess and some number
of very-near guesses.
--format=<format>: output format (default is 'ccsid'). See valid formats below.
--convert=<mode>: convert file to UTF-8 (default is 'none'). See valid modes below.
--autofix automatically and unapologetically change the CCSID tag of the file
to match the top guess for the file's contents (IBM i only)
Valid formats include:
ccsid: Show the CCSID only
enc: Show the CCSID and encoding name
Valid convert modes include:
none : perform no conversion
inplace: Convert the file in-place (creates a .bak with the old contents)
dotutf8: Create a new file that is UTF-8 (extension will be .utf8)
Installation
Simply download the latest ccsidguesser.jar
from the Releases page.
Examples
Show the best guess CCSID for myfile.txt
:
java -jar ccsidguesser.jar myfile.txt
Show the top 5 guesses for the CCSID and their corresponding encoding names:
java -jar ccsidguesser.jar myfile.txt --show=top5 --format=enc
Automatically set myfile.txt
's CCSID tag to match the best guess for the file's contents:
java -jar ccsidguesser.jar myfile.txt --autofix
Convert myfile.txt
to UTF-8 "in place":
java -jar ccsidguesser.jar myfile.txt --convert=inplace