Bookmark and Share  

Unifier FAQ

The content of my file is corrupted after conversion. Why?

The most probable reason is that you have incorrectly specified the character-set encoding of the source file. Please check the encoding of the source file and try conversion again.

What is the filename of the backup and converted files?

Please refer to Specify Output Option Topic in online help

What is the difference between Western(Windows), Western(DOS) and ISO8859-1?

All of these encodings are based on ASCII. The first 127 characters are the same and they are standard English character, digit and symbol. The remaining 128 characters are different on Windows and DOS platform. If your text file is created by DOS applications, you should select Western(DOS). Similarly, if your text file is created by Windows applications, you should select Western (Windows).

ISO8859-1 is the default character in HTML standard. It consists of 256 code points (0-255). The first 256 Code Point of Unicode are exactly same as ISO8859-1. If you select ISO8859-1, the code point of all characters in the source remains the same after conversion. But the code point is stored in 16-bit Unicode Format or 8-Bit UTF-8 Format.

What is GB18030 encoding?

 Please read this page from Microsoft GlobalDev Web Site. There is plenty of information about GB18030 encoding.

Which Unicode Format should I use?

The short answer is that UTF-8 format should be used on HTML-based file only (i.e. HTML, ASP and PHP). All other files should use Unicode (UTF-16) Format.

Can I use UTF-16 format on HTML files?

Yes. But according to HTML standard, HTML files should be transmitted in Big-Endian format and BOM (Byte Order Mark) is recommended at the beginning of the documents. But Windows is a Little-Endian platform. Thus, to avoid the complexity of byte order conversion, it is recommended to use UTF-8 encoding on HTML files.

How do I convert lots of files in a folder (and its sub-foler)?

You may use the built-in File Search function. It helps you to find out all relevant source files in a folder and its sub-folder. You may use wildcard filter or regular expression to specify the search criteria. Then, you may add those files to Source File List.

Can I use wildcard characters to specify filename in unattended mode?

Yes, wildcard character is supported. Unifier will try to do the searching and convert all the files which match the wilcard filter.

How do I get more information about Unicode?

The best way is to visit Unicode Consortium Web Page

What is the format of Unifier Log File?

Unifier running on NT-based platform save log file in Unicode Text File.

How do I edit Unicode Text file on my system?

If you use Windows 2000 or XP, the Notepad and Wordpad application already support Unicode properly. If you use Windows NT 4.0, the Notepad application does recognize Unicode Text Files. But the system may be unable to display all characters unless a Unicode-based font has been installed and selected in Notepad. For Windows 95/98/ME, third-party editor is required to use Unicode. Our Web Site has listed some good Unicode Editors.

Internet Explorer 4 or later supports Unicode properly on all Windows Platforms. You may simply drop your text file to IE to view the file.