Issues with character encoding please help?

by Moto » Wed, 21 Apr 2010 12:10:17 GMT


Sponsored Links
 I get an array of bytes and I convert that to a String in the
following way:
blockString = new String(tagBlock);

The results are good for english and some other languages but Russian,
Spanish, Chinese, and many others aren't displayed properly...  The
only way to diplay correctly is to find the correct encoding and
decode the language like this:
blockString = new String(tagBlock,"ISO8859_1");

But how can I do this better?  How can I not have to worry about
string encoding?  It's turning out to be a problem back in Japan and
other countries with my app :(

Thanks for any help!
-Moto!

--



Issues with character encoding please help?

by Moto » Wed, 21 Apr 2010 19:47:27 GMT


 I'm sure I'm doing something wrong!?



>


Sponsored Links


Issues with character encoding please help?

by Moto » Thu, 22 Apr 2010 08:42:10 GMT


 I guess no one runs into this kind of issues...???




> >



Issues with character encoding please help?

by Bob Kerns » Thu, 22 Apr 2010 15:09:27 GMT


 No, it's just that I've already told everyone else on the planet (it
seems) not to do this! I've been crusading on this for decades... :=)

Do not ever use the String(byte[]) constructor.

Do not ever pass up an opportunity to explicitly supply a character
encoding.

Always use UTF-8 when you have control over the encoding. But Never,
ever, leave it to the platform default. You must always find out the
encoding somewhere.

If it really is just a bunch of bytes of unknown encoding, and you
can't change that, then you will need to allow the user to make an
alternative selection -- but start with UTF-8 as the default if at all
possible.

Nobody, anywhere, should be using anything but UTF-8 for new data
these days. The world has moved far beyond all these petty national
encodings.

So the first thing is to ask -- where do these bytes come from? If
they're not already in UTF-8, can I change something to get them in
UTF-8? If not, can I get the supplier of these bytes to tell me what
encoding they're in (such as the Content-Type: HTTP file)? If not, how
can I let the user tell me the encoding and character set, if I guess
wrong?





> > >



Other Threads

1. Archos 5 porting group

For all those who showed interest in the Android port to the "old"  
Archos 5, there is now a google group you can join at;

2. Anyone with contacts at GetJar?

If anyone has any contacts at GetJar can they pass on the message that  
its' been discovered that AppsLib are using their download hosting  
service possibly without their knowledge or agreement (they're also  
doing it to us and AndAppStore and to SlideME, neither of which they  
have agreements with either).

If they want some proof, tell them to download 
http://files.appslib.com/db/Releases.dat 
  (the file AppsLib uses to hold release information), and search for  
getjar, they'll then see all the URLs of their that are being used.

I tried using their contact form and kept telling me my message was  
too long even when it was < 100 characters :(.

Thanks,

Al.
--

* Looking for Android Apps? - Try http://andappstore.com/ *

======
Funky Android Limited is registered in England & Wales with the  
company number  6741909.

The views expressed in this email are those of the author and not  
necessarily those of Funky Android Limited, it's associates, or it's  
subsidiaries.


--~--~---------~--~----~------------~-------~--~----~

3. OnCompletion video notification on MediaController and VideView usage?

4. Bug in dalvik? Application will not load on package change

5. Instant Refresh on CursorAdapter attached to a ListView

6. BufferedReader encoding and non-english chars

7. Why Surface postBuffers doesn't work fine sometimes, thanks your reply