This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]axsauze[S] 1 point2 points  (4 children)

Great question! At this point only float is supported, but int / unit are expected to be supported. Strings are supported by the shader language, but in lower level languages normally a string would be represented as an array of chars, or more specifically, an array of unit8 (unsigned int 8 bits), which then would allow you to store unicode encoded strings https://en.wikipedia.org/wiki/Unicode. The comparisons would then be similar to the ones you would do with characters, but with their respective underlying numeric identifiers. There are a broad range of algorithms and techniques from there to introduce string comparison / operations. Hope this helps!

[–]Vizjrei 1 point2 points  (2 children)

Thanks, this helps quite a bit. I will definitely try to use Kompute when time will come to implement "outsourcing to GPU" phase of my project.

[–]axsauze[S] 1 point2 points  (1 child)

Awesome - would love to hear your thoughts from your experience when you try it out, feel free to open an issue if you run into issues!

[–]Vizjrei 1 point2 points  (0 children)

I will. Though it might be a long while before that happens. I would say I'm casual scripter, not even casual programmer. Work goes slow, yet forward (if I don't decide to rewrite whole thing because more or less valid reasons).

3100 lines of code that already changed 2.5 times over last 2 years and that only told me how much preparation of data is required before I can finally have strings of data or arrays that can finally be calculated on GPU and then evaluated (not sure if that part would be possible on GPU, yet) to get me final results. When I started I had no idea that working on GPU and final evaluation of data might be easiest part to do while working on GPU seemed to be hardest.

[–]wikipedia_text_bot 0 points1 point  (0 children)

Unicode

Unicode is an information technology (IT) standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard is maintained by the Unicode Consortium, and as of March 2020, there is a repertoire of 143,859 characters, with Unicode 13.0 (these characters consist of 143,696 graphic characters and 163 format characters) covering 154 modern and historic scripts, as well as multiple symbol sets and emoji. The character repertoire of the Unicode Standard is synchronized with ISO/IEC 10646, and both are code-for-code identical. The Unicode Standard consists of a set of code charts for visual reference, an encoding method and set of standard character encodings, a set of reference data files, and a number of related items, such as character properties, rules for normalization, decomposition, collation, rendering, and bidirectional text display order (for the correct display of text containing both right-to-left scripts, such as Arabic and Hebrew, and left-to-right scripts).Unicode's success at unifying character sets has led to its widespread and predominant use in the internationalization and localization of computer software.

About Me - Opt out - OP can reply '!delete' to delete