Ubuntu :: Some Japanese Text Comes Out As Garbage Characters
Jan 23, 2010I have installed scim and anthy. Most Japanese characters display, but some websites and files show garbage characters. Is there any way to resolve this?
View 1 RepliesI have installed scim and anthy. Most Japanese characters display, but some websites and files show garbage characters. Is there any way to resolve this?
View 1 RepliesI have installed scim and anthy. Most Japanese characters display, but some websites and files show garbage characters.
View 1 Replies View RelatedActually I want to log a bug but I don't really know what package to log it against. The problem is that by default Pango is choosing the AR PL UMing CN as the font to render Japanese text when the current font doesn't have Japanese glyphs. But AR PL UMing CN is a Chinese font, so Chinese glyphs for kanji characters (e.g., 覚) are displayed. This is jarring and confusing for Japanese readers.
This situation mostly arises when you have mixed English and Japanese text. Some applications (for instance Firefox) will allow you to select a font for Asian text. Thus if the text contains only Asian characters it will use the font you select, rather than what Pango would have selected. But if it is a mix of English and Japanese, you end up with the wrong glyphs.
Other environments (like gnome-terminal, or a gedit) have difficulties as well. Since the primary interface requires mono spaced roman characters you run into difficulty selecting fonts. Most Japanese fonts only have proportional roman characters. This means that if use a nice roman font and use Japanese text (for instance file names), you end up with Chinese glyphs. What I want is a mechanism that will work across all of Gnome for selecting the font I want to use for Chinese characters. That way I can choose either Japanese or Chinese glyphs.
I realize this is low priority. It only bugs me a little, but many of my Japanese colleagues are put off from using Ubuntu because they are confused by the Chinese glyphs that pop up on my screen from time to time. As I said, I'd like to file a bug, but I'm not sure against what package...
The code below prints garbage if I use puts() but it is fine if I use printf() instead (see sample of output at the bottom of this post). The odd thing is that the mess always starts after exactly eight properly printed characters. That sort of regularity can't be a coincidence, can it? It almost looks like an encoding issue (I'm using UTF-8) but if it really is, then I don't understand why the printf() is unaffected.
Code:
#include <stdio.h>
#include <string.h>
const char *nChars (const char *, int);
int main (int argc, char * argv[]){
char * name ="Count Dracula++";
int len = strlen(name);
if (argv[1] == 0) argv[1]="printf";
printf("--->%d characters", len); .....
I have the Japanese language pack installed and I have ibus and anthy installed for input method management. It all works fine and dandy. Except that some kanji aren't right. Like 社会 the first of the two kanji displayed for me is the archaic version. I can't seem to figure out why this is happening. I can only assume it's picking up data from the wrong font package, but I'm not sure how to manage this. Happens in ibus for my own input and on websites like www.jisho.org where my input was unrelated.Using a fairly fresh install of ubuntu 10.04 lucid, I used scim for IM in Karmic but still stuck to the default japanese language support pack. Worked fine until lucid.
View 7 Replies View RelatedDebian won't display Japanese characters properly, it shows them as symbols. Is there a language pack or a particular browser plugin I need to install? It's sort of a noobish question, but I looked for something related to this issue in my Package Manager, and didn't find anything that seemed suitable/related.
View 2 Replies View RelatedI just installed CentOS 5.2. I have both fonts-japanese and fonts-chinese installed. But I cannot see characters displayed correctly. All Chinese and Japanese characters are displayed as blocks of hexadecimals, except Japanese kana. How can I make them displayed correctly?
*** Appendix 1: /etc/X11/xorg.conf ***
# Xorg configuration created by system-config-display
Section "ServerLayout"
Identifier "single head configuration"
Screen 0 "Screen0" 0 0
InputDevice "Keyboard0" "CoreKeyboard"
InputDevice "Synaptics" "CorePointer"
EndSection .....
I recently intalled Debian lenny and I'm having issues with some of the unicode characters. Instead of displaying the symbols properly it shows one of the following depending on font/app:
1) Square outline with four letters/numbers arranged inside
2) Just a blank square outline
3) Just a blank space
I haven't been able to test all possible characters, but from a quick check it seems that Cyrillic works properly, Japanese doesn't.A few Google searches later and I'm no wiser on how to fix the issue. Any help?
I use the below perl program csv2xls.pl can convert a csv to xls file , it works fine , but I found that it only work for all English characters , I tried to use it to convert a csv file ( with Japanese characters ) , it does work , I also tried the perl "unicode_utf16_japan.pl" , it also did not work , can advise what can i do ?
View 13 Replies View Relatedif I open Yast2 via SSH I see the garbage (code page related issue; similar to this). If I set some other than UTF-8 in PuTTY (KOI8-U, iso-8859 etc) Yast looks good, but then the same problem appears with Midnight Commander. How can I have Yast to use 'right' UTF-8 instead of any legacy code pages? Or how better deal with the situation?
View 2 Replies View RelatedI am facing a peculiar issue when printing Japanese text through CUPS (though I am not sure if this is the right forum).
I developed a Java application (that uses a graphical object to print to a PrinterJob class) that prints text (of Japanese characters) to a printer. When I login in en_US/en_UK locale, the Japanese characters get printed from my Java app just fine. However, when I login in ja_JP and give a print job through my Java app, no Japanese text is printed at all. I get characters from only within the ASCII subset printed instead. I am using Serif and Courier New fonts in my app. Relevant details are:
In ja_JP,
a@a:/usr/share/cups/charsets$ fc-match serif:lang=ja
ttf-japanese-mincho.ttf: "Sazanami Mincho" "Regular"
a@a:/usr/share/cups/charsets$ fc-match sans serif:lang=ja
ttf-japanese-gothic.ttf: "VL Gothic" "regular"
[Code]...
I need to be able to switch between English and Japanese text when typing on my computer but I have no idea how to. I figured it out on both Windows and Mac but Ubuntu is giving me some trouble. I figure IBus is where I'm supposed to be going but even though I selected Japanese, it just isn't working.
View 2 Replies View RelatedBefore I upgraded to Karmic, I had Japanese text input set up on my system, but since the upgrade it no longer works. I can still read Japanese text, and when I press Ctrl+SPACE I still get the ANTHY box in the bottom right-hand corner, but Japanese is no longer on the list of supported languages. how to get Japanese back?
View 1 Replies View Relatedscim-anthy seems to have been installed perfectly... however, ctrl-space or any other combinations that i'm use to don't activate it... i see the keyboard icon, i can go in set up the environment in it but... i can't get the japanese language bar to show up so I can type in Japanese.
View 1 Replies View Relatedi have a problem, when i open OO, the text of the menus is replaced by little rectangles.
For example, instead of:
File Edit View etc..
(they look more like rectangles than squares) i also have this problem with drop down menus, font names, font styles and even with the numbers on the ruler.
Ubuntu 9.10, OO 3.1
I have an odd request, lets say I have a text file like this:
Quote:
Code:
__##___##___IIIIII
__##___##_____II__
__##___##_____II__
__#######_____II__
__##___##_____II__
__##___##_____II__
__##___##___IIIIII
I want a script that will replace all of the hash tabs with random characters.I found this command will generate a random character
Quote:
apg -a1 -n1 -m1 -x1 -d
how to script a search and replace using that random character upon each successive find until the end of file occurs.
I wish to show text between two characters with PHP. Here is some example text:
<p class="abc">My</p>
<table border="1">
<tr><td>name</td><td>is</td></tr>
[code]...
I have a number of text files (26 per database x 100+ databases) which need 'correcting' in order to import into postgresql. I think that I have identified all the problem characters and I need to automate the process as much as possible. I have a script to convert the characters and I do them one by one (not effecient but easier to understand).
[Code].....
I can't find the reason cups pdf driver do this. I created my default printer as is told:
CUPS PDF--> Generic Printer--> Generic CUPS-PDF Printer
The pdf created shows well, but when i drag to select and copy text, it copies some strange characters as following:
✁✂✂✄
☎✆✝✞✟✟✝✆✠✡☛ ☞✌☛✍✡✎ ✏ ☛✄✑✄ ✎✞✟✄✂✒✞✓ ✔✁✓✁ ✕✖✞✑✗✓✞ ✍✁✂✂✄
☎✆✝✞✟✟✝✆✜✁ ✢✖✟✝✖✛✁ ✘✞✂ ✘✖✣✞✓✄ ✏ ✤✁✛✥ ✜✁✦✟✄✣☎✘✄✛
✁✂✂✄
☎✆✝✞✟✟✝✆✕✞✛✓✞✝✄✟ ✘✞ ✙✣✁ ✑✞✣✝✞ ✖✣✝✞✂✖✧✞✣✝✞★ ✍✞✛✣✖✛✁✂✂✄
I configured cups and I managed to print successfully with generic text only but my problem is that the printout always comes out as 80 columns. anything more than 80 colums wraps on the next page. I tried changing the pagesize at the textonly.ppd but it didn't work.my textonly.ppd is
*PPD-Adobe: "4.3"
*%
*% Text-only printer definition
[code]...
I am using a VRS for ftp operations. I have bought it with OpenSuSe 11.3 installed without any Control Panel installed. I have used NX and installed IceWM as windows manager. It works really great. Then system wanted me to upgrade. I did so. After upgrade to 11.4, my programs screwed up. They don't show any text at all but squares instead. I have tried everything i know to fix this but i could not manage to repair it.
My VRS has 512mb RAM , OpenSuse 11.4 (32-Bit) and has no control panel. I use yast through SSH Telnet client (Putty).
When viewing a particular text file in vim or less on Linux or OS X, all the en dashes show up as highlighted "<97>" characters.What control-sequence do I need to type in order to substitute the hyphens back? For example, the following doesn't work in vim:% s/<97>/--/gTyping bracket nine seven bracket is not the same as typing the actual special character.
View 2 Replies View Relatedfor example
else {
for fileDOC in $location/*.doc
do
[code]...
What command could I use in terminal to delete all ASCII characters? That is, delete a-z, A-Z, 0-9, and all punctuation? I have a file containing Chinese characters, and I want to remove everything else and leave just the Chinese.
I can use grep to leave only the lines that have Chinese in them, but this still leaves a lot of non-Chinese stuff on those lines. Does anyone know how I could actually remove everything that isn't Chinese?
While modifying the definition of my PS1, I saw that "[" and "]" markers should be added to help bash to compute the right display lenght. Many exemples on the web do not use them or even mention them.I searched for a solution to add them automatically, like with sed, but I didn't find any example.Are they still needed and is there a recommandation not to use sed to define PS1?
View 1 Replies View RelatedI have my OpenSuse 11.1 box set up with utf-8, however, every time I try to open a file with utf-8 characters with vi it can't handle those characters properly.
View 6 Replies View RelatedI am trying to get a new box up. It has XP on one HDD. There is a second HDD, upon which I want to install Ubuntu and dual boot. My problem is that my CD drive is garbage and won't run the installer. Nor do I have any flash drives. Is it possible to install directly from Windows but not "within Windows" persay? If so, how is this done? (I do have Daemon Tools.)
View 2 Replies View RelatedWith 10.04 final i still have the issue with a very distorted screen. Works with RadeonHD driver but performance is utterly crap so no good solution. Also when adding new panels they become invisible and only a logout and login makes them visible around the border. Errors somehow related? How do i file a bug report?
View 1 Replies View RelatedI was wondering if it is possible to have a garbage collector in C, and would it be too difficult to create?
View 1 Replies View RelatedCode:
char retBuffer[100];
recv(sock , retBuffer ,sizeof(retBuffer) , 0);
if i send a string smaller then my buffer size retBuffer always have garbage of the end of it....what can i do to eliminate tis problem ?