Skip to main content

Questions tagged [unicode]

Unicode is intended to be a universal character set for describing all the characters required for written text incorporating all writing systems, technical symbols and punctuation.

6votes
3answers
930views

A String View Library in C

I was recently working through the PintOS projects and became curious if there was a better way to do some string processing in C. Specifically, instead of strtok_r,...
Alex Lopez's user avatar
6votes
1answer
745views

Transcoding UTF-8 to UTF-16-LE in VBA

VBA is a language that's lacking a lot of basic functionality. (Pun intended) Most libraries, if they exist in the first place, are OS-specific, and even some of the inbuilt functions don't work on ...
GWD's user avatar
  • 195
0votes
1answer
179views

supporting unicode in rust lexer

I am working on a lexer in rust. It has been through many different iterations, going from iterating over a Vec<char> to ...
xxxxxxxxxx's user avatar
6votes
1answer
2kviews

Client server communications through unix signals in C

For school, I have to create a server-client communication through Unix signals and only with SIGUSR1 and SIGUSR2. Client ...
dieri's user avatar
1vote
2answers
176views

Parse string using character value for Unicode characters

I added the getCValue method to reduce the cyclomatic complexity, but the cyclomatic complexity still persists. How can I reduce it? Can I change this code using ...
stromboli's user avatar
5votes
1answer
474views

Converting IDN domains to Punycode in Perl

Description This script takes any domain input from STDIN and converts unicode domains into punycode. Features Any domains that throw an error get ignored. When fed any ASCII domains, they just pass ...
T145's user avatar
  • 3,147
4votes
2answers
780views

UTF-8 to UTF-16 using C++20 concepts and ranges

A follow-up for this previous question. I took into account previous reviews, and tried to make a simple API. I had never done anything non-trivial with C++20 concepts and ranges until now, so I am ...
Aykhan Hagverdili's user avatar
6votes
4answers
3kviews

UTF-8 to UTF-16 (char8_t string to char16_t string)

Below is an implementation of a UTF-8 string to UTF-16 string. Kind of like MultiByteToWideChar on Win32, but it's cross-platform and ...
Aykhan Hagverdili's user avatar
3votes
1answer
149views

Appending a codepoint to an UTF8 std::string using icu4c

My code is ...
Alexey Romanov's user avatar
10votes
2answers
1kviews

Text based rendering/animation engine for the terminal

This project was highly inspired by the popular drawille project, that lets one draw to the terminal using the braille unicode characters. The advantage of drawing with braille characters compared to ...
766F6964's user avatar
8votes
1answer
4kviews

JavaScript string to Unicode (Hex)

This set of questions are related to a project I've published for converting characters, or strings, to Hex based Unicode; eg... ...
S0AndS0's user avatar
1vote
1answer
2kviews

The conversion from UTF-16 to UTF-8

I have created a function that converts from UTF-16 to UTF-8. This function converts from UTF-16 to codepoint firstly, then from codepoint to UTF-8. ...
Lion King's user avatar
22votes
6answers
4kviews

Transcode UCS-4BE to UTF-8

Below is my entire program. You can read what it does thanks to the comments and specifications in particular. My question is: can it be improved? Would it be possible, for example, to avoid writing a ...
lettomobile's user avatar
8votes
1answer
430views

Printing Colored, CLId Unicode Chess Board - follow-up

Follow up to: Printing Command Line Unicode Chess Board A couple people from the previous question mentioned looking into coloring the command line with ANSI escape sequences. In Python, ...
T145's user avatar
  • 3,147
12votes
3answers
8kviews

Printing Command Line Unicode Chess Board

Let me preface this by detailing some of my design philosophy. The intent is to view the board from the perspective of the active player. Player moves are also factored into the design. When ...
T145's user avatar
  • 3,147

153050per page
close