- Community Home
- >
- Servers and Operating Systems
- >
- Operating Systems
- >
- Operating System - HP-UX
- >
- UTF-8 charset under HPUX
Categories
Company
Local Language
Forums
Discussions
Forums
- Data Protection and Retention
- Entry Storage Systems
- Legacy
- Midrange and Enterprise Storage
- Storage Networking
- HPE Nimble Storage
Discussions
Forums
Discussions
Discussions
Discussions
Forums
Discussions
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
- BladeSystem Infrastructure and Application Solutions
- Appliance Servers
- Alpha Servers
- BackOffice Products
- Internet Products
- HPE 9000 and HPE e3000 Servers
- Networking
- Netservers
- Secure OS Software for Linux
- Server Management (Insight Manager 7)
- Windows Server 2003
- Operating System - Tru64 Unix
- ProLiant Deployment and Provisioning
- Linux-Based Community / Regional
- Microsoft System Center Integration
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Discussion Boards
Community
Resources
Forums
Blogs
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-10-2007 12:05 PM
12-10-2007 12:05 PM
UTF-8 charset under HPUX
I'm currently dealing with a problem since 3 weeks.
Let me explain ..
CIFS SERVER UNDER HP-UX B.11.23 U ia64
======================================
Configuration File ( /etc/opt/samba/smb.conf ) :
[global]
workgroup = ADHARA
client code page = 850
character set = ISO8859-1
server string = "Samba Server"
log file = /var/opt/samba/log.%m
max log size = 1000
security = user
encrypt passwords = yes
socket options = TCP_NODELAY
local master = no
preserve case = yes
short preserve case = no
dos filetime resolution = yes
read only = no
syslog = 0
( I removed the commented part )
[GEMMA]
comment = GEMMA
path =/tmp/gemma_file
valid users = @smbadm, jjgui
public = no
writable = yes
printable = no
write list = @smbadm, jjgui
======================================
Client :
Microsoft Windows XP ...
French customer
======================================
Problem :
When the guy is sending a file from Windows to the Unix share, then the convertion is bad, there is some strange tabulations represented by a little square.
I received a mail from the customer saying :
"The original format is ANSI ( Unix format ). Once converted with unix2dos command, the file become UTF-8 ( DOS / Windows Format ). It might be enough to set the convertion to UTF-8."
======================================
Unfortunately. I cannot find any codepage for UTF-8 format under HPUX. I've try thoses options :
dos charset = CP850
unix charset = UTF-8
Results :
Ignoring unknown parameter "dos charset"
[2007/12/10 20:00:35, 0] param/loadparm.c:map_parameter(2129)
Unknown parameter encountered: "unix charset"
[2007/12/10 20:00:35, 0] param/loadparm.c:lp_do_parameter(2817)
So, I tried :
client code page = 850
character set = UTF-8
I get an error because the file /etc/opt/samba/codepages/codepage.UTF-8 has not been found
======================================
My questions are :
1- Is there any parameters to set like the dos and unix charset ?
2- Is there any place where I can find the UTF-8 codepage file ?
Thanks in advance for your advice, question ...
David Bourcier
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-11-2007 12:54 AM
12-11-2007 12:54 AM
Re: UTF-8 charset under HPUX
Technically, UTF-8 is an 8-bit encoding of Unicode, not a character set, although Sambe (and others) are a little free with the meaning of "character set".
There can't be a UTF-8 code page because a code page selection switches the top 128 characters in an 8-bit code set (the bottom 128 are the "standard" ASCII set), while UTF-8 uses a variable-length encoding of the full Unicode character set (apologies if I'm telling you something you already know).
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-12-2007 12:25 AM
12-12-2007 12:25 AM
Re: UTF-8 charset under HPUX
Thanks for your really usefull help. So I've upgraded the CIFS server on the HPUX to CIFS-SERVER 3 ( based on Samba 3.0.2 ). The good news is that UTF-8 is now known by Samba, but the customer still have the same problem. Here is the parameter I set :
dos charset = CP850
unix charset = UTF-8
display charset = UTF-8
The parameter client code page is unknown.
Any idea of how I can avoid this problem ? Or even, how to implement a automatic conversion of the file using the program ux2dos when someone is tranferring a file on the share ?
Thanks in advance for your help
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-12-2007 01:30 AM
12-12-2007 01:30 AM
Re: UTF-8 charset under HPUX
As there is no way to automatically determine with 100% certainty which files contain binary data which must not be modified and which are plaintext, Samba won't convert the _contents_ of the files at all.
MK
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-12-2007 07:22 AM
12-12-2007 07:22 AM
Re: UTF-8 charset under HPUX
So, it seems that there is no workaround possible to fix my problem, at least under the samba configuration.
Thanks to all for your precious help, do not hesitate to "ping" me if you have any workaround possible :)
Have a nice Day. Bye
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
12-13-2007 01:35 AM
12-13-2007 01:35 AM
Re: UTF-8 charset under HPUX
When and how does your customer see the "translation"? If he reads the file on the same system it was written, it should be unchanged.
Since he mentions unix2dos, I assume he's writing it on one system (UNIX) and reading it on another (DOS)? Unix2dos should only change the new line sequences (