Questions tagged [unicode]
Unicode is intended to be a universal character set for describing all the characters required for written text incorporating all writing systems, technical symbols and punctuation.
77 questions
6votes
3answers
930views
A String View Library in C
I was recently working through the PintOS projects and became curious if there was a better way to do some string processing in C. Specifically, instead of strtok_r,...
6votes
1answer
745views
Transcoding UTF-8 to UTF-16-LE in VBA
VBA is a language that's lacking a lot of basic functionality. (Pun intended) Most libraries, if they exist in the first place, are OS-specific, and even some of the inbuilt functions don't work on ...
0votes
1answer
179views
supporting unicode in rust lexer
I am working on a lexer in rust. It has been through many different iterations, going from iterating over a Vec<char> to ...
6votes
1answer
2kviews
Client server communications through unix signals in C
For school, I have to create a server-client communication through Unix signals and only with SIGUSR1 and SIGUSR2. Client ...
1vote
2answers
176views
Parse string using character value for Unicode characters
I added the getCValue method to reduce the cyclomatic complexity, but the cyclomatic complexity still persists. How can I reduce it? Can I change this code using ...
5votes
1answer
474views
Converting IDN domains to Punycode in Perl
Description This script takes any domain input from STDIN and converts unicode domains into punycode. Features Any domains that throw an error get ignored. When fed any ASCII domains, they just pass ...
4votes
2answers
780views
UTF-8 to UTF-16 using C++20 concepts and ranges
A follow-up for this previous question. I took into account previous reviews, and tried to make a simple API. I had never done anything non-trivial with C++20 concepts and ranges until now, so I am ...
6votes
4answers
3kviews
UTF-8 to UTF-16 (char8_t string to char16_t string)
Below is an implementation of a UTF-8 string to UTF-16 string. Kind of like MultiByteToWideChar on Win32, but it's cross-platform and ...
3votes
1answer
149views
Appending a codepoint to an UTF8 std::string using icu4c
My code is ...
10votes
2answers
1kviews
Text based rendering/animation engine for the terminal
This project was highly inspired by the popular drawille project, that lets one draw to the terminal using the braille unicode characters. The advantage of drawing with braille characters compared to ...
8votes
1answer
4kviews
JavaScript string to Unicode (Hex)
This set of questions are related to a project I've published for converting characters, or strings, to Hex based Unicode; eg... ...
1vote
1answer
2kviews
The conversion from UTF-16 to UTF-8
I have created a function that converts from UTF-16 to UTF-8. This function converts from UTF-16 to codepoint firstly, then from codepoint to UTF-8. ...
22votes
6answers
4kviews
Transcode UCS-4BE to UTF-8
Below is my entire program. You can read what it does thanks to the comments and specifications in particular. My question is: can it be improved? Would it be possible, for example, to avoid writing a ...
8votes
1answer
430views
Printing Colored, CLId Unicode Chess Board - follow-up
Follow up to: Printing Command Line Unicode Chess Board A couple people from the previous question mentioned looking into coloring the command line with ANSI escape sequences. In Python, ...
12votes
3answers
8kviews
Printing Command Line Unicode Chess Board
Let me preface this by detailing some of my design philosophy. The intent is to view the board from the perspective of the active player. Player moves are also factored into the design. When ...