C / C++ / MFC

Re: get wide character and multibyte character value

25-Jan-08 3:15

George_George wrote:
For the same wide character string in Czech, I am not sure whether converting to Czech codepage (1250) or CP_UTF8 will making information lose ... i.e. resulting in ?? (0x3F) character.

If you know for sure the wide string contains only Czech (or other Central Europian) characters, you can convert it to either CP1250 or UTF-8 without any loss. In any case, you don't need to mess with the system language settings - just use the correct codepage parameter in WideStringToMultiChar.

If you don't know for sure the wide string contains only Central Europian characters, you can still safely convert to UTF-8, but not to CP1250.

Programming Blog

utf8-cpp

George_George25-Jan-08 3:19

Re: get wide character and multibyte character value

25-Jan-08 3:19

Great Nemanja!

But why not safe with CP1250?

Nemanja Trifunovic wrote:
If you don't know for sure the wide string contains only Central Europian characters, you can still safely convert to UTF-8, but not to CP1250.

regards,
George

Nemanja Trifunovic25-Jan-08 3:26

Re: get wide character and multibyte character value

25-Jan-08 3:26

George_George wrote:
But why not safe with CP1250?

If a wide string contains a non-Central Europian character, say - U+03A8 (Greek capital Psi); it has no representation in CP1250 and will appear as a replacement character (probably question mark) after the conversion.

Programming Blog

utf8-cpp

George_George25-Jan-08 3:30

Re: get wide character and multibyte character value

25-Jan-08 3:30

Thanks for your clarification, Nemanja!

My question is answered.

regards,
George

CPallini24-Jan-08 21:08

CPallini

24-Jan-08 21:08

Nemanja Trifunovic wrote:
here is 1-1 mapping between UTF-16 (wide chars on Windows) and UTF-8 (both are simply different encoding forms for Unicode character set).

You're right indeed.

Nemanja Trifunovic wrote:
On the other hand, there is no 1-1 mapping between Unicode and CP1250.

There is 1-1 mapping for the subset representing Czech characters, I suppose.
Smile | :)

If the Lord God Almighty had consulted me before embarking upon the Creation, I would have recommended something simpler.
-- Alfonso the Wise, 13th Century King of Castile.

[my articles]

Re: get wide character and multibyte character value

Nemanja Trifunovic25-Jan-08 3:17

Re: get wide character and multibyte character value

25-Jan-08 3:17

CPallini wrote:
There is 1-1 mapping for the subset representing Czech characters, I suppose.

Correct. It is just not always easy to be sure that the wide string contains only the Czech subset.

Programming Blog

utf8-cpp

Nemanja Trifunovic24-Jan-08 4:29

Re: get wide character and multibyte character value

24-Jan-08 4:29

George_George wrote:
I need to know the wide character (unicode) and multibyte (UTF-8) values of a character string of czech.

Is this string provided as a user input, or you are dealing with a hard-coded string literal?

Anyway, to get a UTF-8 representation of a Unicode string, there is no need to change the system language. Either use WideCharToMultiByte with codepage set to CP_UTF8, or use a third-party library (like the one you see in my signature Wink | ;)

)

Programming Blog

utf8-cpp

George_George24-Jan-08 14:32

Re: get wide character and multibyte character value

24-Jan-08 14:32

Thanks Nemanja,

Nemanja Trifunovic wrote:
Anyway, to get a UTF-8 representation of a Unicode string, there is no need to change the system language. Either use WideCharToMultiByte with codepage set to CP_UTF8, or use a third-party library (like the one you see in my signature )

1. Change system language settings before invoking WideCharToMultiByte, right?

2. Should I use UTF-8 code page or use some Czech code page?

regards,
George

Nemanja Trifunovic24-Jan-08 14:34

Re: get wide character and multibyte character value

24-Jan-08 14:34

George_George wrote:
1. Change system language settings before invoking WideCharToMultiByte, right?

Don't

George_George wrote:
2. Should I use UTF-8 code page or use some Czech code page?

UTF-8

Programming Blog

utf8-cpp

George_George24-Jan-08 14:58

Re: get wide character and multibyte character value

24-Jan-08 14:58

Thanks Nemanja!

From your reply and CPallini's reply, I am confused.

This is what you mentioned, and you think using UTF-8 code page to convert wide character to multibyte character is ok.

Nemanja Trifunovic wrote:
George_George wrote:
2. Should I use UTF-8 code page or use some Czech code page?

UTF-8

2.

This is what CPallini mentioned,

http://www.codeproject.com/script/Forums/View.aspx?fid=1647&select=2401259&fr=107#xx2401259xx[^]

He said using UTF-8 is not always safe and we should use code page 1250?

regards,
George

Nemanja Trifunovic24-Jan-08 15:23

Regarding bakground services

24-Jan-08 15:23

George_George wrote:
This is what you mentioned, and you think using UTF-8 code page to convert wide character to multibyte character is ok.

What I say is simply the following: Both UTF-16 (stored in "wide characters") and UTF-8 (stored in "multibyte characters") are different encoding forms of the same character set (Unicode) and there is absolutelly no risk of conversion loss between the two. In fact, you don't even need to use WinAPI to do the conversion - just take a look at my utf8 - cpp library to see how it is done manually.

Programming Blog

utf8-cpp

tasumisra23-Jan-08 19:49

tasumisra

23-Jan-08 19:49

Dear all,
we can invoke our VC++ dlls from backgroud services.. like from "services.msc",

i have adoubt when we are writing such services what should be taken care so that it comes under "services.msc" not in "dcomcnfg"

and what is the difference in those two types ....

Thanks in advance

T@SU

Re: Regarding bakground services

led mike24-Jan-08 5:23

led mike

24-Jan-08 5:23

After reading your post several times I have no idea what you trying to say or ask. Since no one else replied to you I imagine others are having similar difficulties understanding you.

led mike

Connect to Unix machine from VC++ code.

cagespear23-Jan-08 17:20

cagespear

23-Jan-08 17:20

I have a requirement where we need to connect to unix machine, run some commands and capture output(on the lines of what "putty" does).

I wanted to know how it can be accomplished through C++ code. We can use SSH, Rlogin connectivity.

Thanks
Amit

Re: Connect to Unix machine from VC++ code.

Maxwell Chen23-Jan-08 18:06

Maxwell Chen

23-Jan-08 18:06

Maybe try some SSH library for Windows. Such as:
http://42.pl/ssh/index_en.html[^]

Maxwell Chen

Re: Connect to Unix machine from VC++ code.

cagespear29-Jan-08 6:18

cagespear

29-Jan-08 6:18

Thanks a ton Maxwell!

It really helped.

How to get file name from a full path

zengkun10023-Jan-08 16:43

zengkun100

23-Jan-08 16:43

Suppose the full path is :
C:\Program Files\codeproject.txt
I want to extract the file name codeproject.txt.
Of course I can use CString class to find the last '\', and then Mid to get the filename, but is there some API can do this daily job?
Thank you Smile | :)

A Chinese VC++ programmer

Maxwell Chen23-Jan-08 16:51

Maxwell Chen

23-Jan-08 16:51

Since you want to use API (Windows SDK), the function below meets your need.
But CFile is even convenient for you though.

BOOL GetFileInformationByHandleEx(
  HANDLE hFile,
  FILE_INFO_BY_HANDLE_CLASS cls,
  LPVOID info,
  DWORD bufsize
);
// ...
FILE_NAME_INFO MyStruct = {0};
bool b = GetFileInformationByHandleEx(hFile, FileNameInfo, &MyStruct, sizeof(FILE_NAME_INFO));

Maxwell Chen

zengkun10023-Jan-08 17:02

zengkun100

23-Jan-08 17:02

OK, Thank you Maxwell Chen Smile | :)

Both of the ways you suggested are convenient. But the question is all of them need a file HANDLE. I just want to get the filename given its fullpath.

A Chinese VC++ programmer

Naveen23-Jan-08 17:01

Naveen

23-Jan-08 17:01

PathStripPath()

nave

[OpenedFileFinder]

zengkun10023-Jan-08 17:05

zengkun100

23-Jan-08 17:05

Thank you nave!
I think I have got the answer:_splitpath Smile | :)

A Chinese VC++ programmer

Abhay Menon23-Jan-08 17:18

Abhay Menon

23-Jan-08 17:18

You can use the CFileFind

CFileFind cf;
cf.FindFile("C:\\Program Files\\codeproject.txt");
cf.FindNext();
CString strFileName = cf.GetFileName();

Rgds
Abhay..

Rajesh R Subramanian23-Jan-08 21:07

Rajesh R Subramanian

23-Jan-08 21:07

zengkun100 wrote:
I think I have got the answer:_splitpath

MSDN Says: _splitpath is deprecated because more secure versions are available, see _splitpath_s, _wsplitpath_s.

Nobody can give you wiser advice than yourself. - Cicero
.·´¯`·->Rajesh<-·´¯`·.
Codeproject.com: Visual C++ MVP