Search found 1211 matches

by Maël
19 May 2020 14:14
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

from https://www.adobe.com/products/type/adobe-type-references-tips/glossary.html Terminology Face or font face or typeface Adobe: One of the styles of a family of faces. For example, the italic style of the Garamond family is a face. CSS: https://developer.mozilla.org/de/docs/Web/CSS/@font-face Pro...
by Maël
18 May 2020 17:36
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Filtering out invalid text, e.g., for Thai, is not done by default by Uniscribe and just a possibility (from http://archives.miloush.net/michkap/archive/2010/12/18/10106789.html ): The code that "filters" these characters sits in code called by the EDIT control that checks for two things: Does the s...
by Maël
18 May 2020 17:35
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Charmap or simple style rendering For the charmap or simple style rendering, a solution could be to use ScriptShape with psa->s.fDisplayZWG = True to reliably know if glyphs are missing for a codepoint and also handle precomposed characters that might be rendered with several glyphs. No need to loo...
by Maël
18 May 2020 17:06
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

So when considering this whole chaos, the solution is a bit complicated. Neither ScriptShape nor ScriptGetCMap will properly look up a glyph (or glyph sequence) for each codepoint. And GetGlyphOutline or GetGlyphIndicies are limited to the BMP. We have two goals: Finding missing (=default) glyphs in...
by Maël
18 May 2020 16:14
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Reliably detecting if a codepoint can be represented by a font After reading many articles/posts/bug reports, including larger parts of the OpenType specification and font rendering techniques, and inspecting and testing font files with "DTL OTMaster 3.7 Light" (and some other font tools), I finall...
by Maël
17 May 2020 14:07
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Clusters in shaping <> grapheme clusters from https://harfbuzz.github.io/clusters.html#clusters-and-shaping In text shaping, a cluster is a sequence of characters that needs to be treated as a single, indivisible unit. A single letter or symbol can be a cluster of its own. Other clusters correspond...
by Maël
17 May 2020 08:50
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Possibly, this may allow to coax Uniscribe to merely use a direct character code to glyph mapping using the cmap table, given SCRIPT_UNDEFINED is used, to completely disable complex shaping. But it's not completely clear if this holds. It may also be possible to enable Uniscribe to do some glyph sub...
by Maël
17 May 2020 08:36
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 11425

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

from https://docs.microsoft.com/en-us/typography/opentype/spec/ttochap1#text-processing-with-opentype-layout-fonts Text processing with OpenType Layout fonts A text-processing client follows a standard process to convert the string of characters entered by a user into positioned glyphs. To produce t...
by Maël
13 May 2020 09:46
Forum: Feature and Enhancement Requests
Topic: Data inspector: round floats to precision/significant digits
Replies: 7
Views: 1458

Re: Data inspector: round floats to precision/significant digits

Thanks to Rick Regan from exploringbinary.com, who wrote a very useful article on converting decimal strings to floating point numbers (with the best possible accuracy) , I have a better understanding how this process works in principle, using BigIntegers. He also kindly pointed me to an article fro...
by Maël
12 May 2020 08:39
Forum: Feature and Enhancement Requests
Topic: Data inspector: round floats to precision/significant digits
Replies: 7
Views: 1458

Re: Data inspector: round floats to precision/significant digits

clang 10 supports float80 on x86-64 (not sure if Win64, too, or just Linux):
https://godbolt.org/z/_edvbT
by Maël
12 May 2020 07:38
Forum: Bug Reports
Topic: Datainspector: byte $C0 in Int8 row (in hexadecimal number mode) should not display as ffffffc0
Replies: 2
Views: 2558

Re: Datainspector: byte $C0 in Int8 row (in hexadecimal number mode) should not display as ffffffc0

Make sure this matches with IntToBase/IntToOffsetBase handling of negative values, i.e., both the data inspector hex value display and the former functions should give the same results, for consistency. This is still left to be done. From the experience gained through https://github.com/maelh/hxd-p...
by Maël
11 May 2020 23:37
Forum: Feature and Enhancement Requests
Topic: Japanese encoding support (EUC-JP and SHIFT-JIS)
Replies: 1
Views: 105

Re: Japanese encoding support (EUC-JP and SHIFT-JIS)

Thanks for filing the bug, as mentioned this can be implemented once Unicode rendering / editing is fully supported: https://forum.mh-nexus.de/viewtopic.php?f=4&t=1004
by Maël
11 May 2020 22:46
Forum: Bug Reports
Topic: Can't search for 64-bit integer
Replies: 2
Views: 130

Re: Can't search for 64-bit integer

Fixed in the development version. The search uses the same code as the datainspector now to convert from text to an integer. It supports also (U)Int24 integer types, includes (U)Int24 types in its search, if you choose "any" bitwidth, and searches for both signed or unsigned numbers. The latter is p...
by Maël
11 May 2020 20:25
Forum: Bug Reports
Topic: Can't search for 64-bit integer
Replies: 2
Views: 130

Re: Can't search for 64-bit integer

Thanks for the bug report. The first is a bug, the second is expected since it searches for signed integers. It should be extendable to search for both, signed integers and unsigned integers, without having to specify the signedness, due to how two's complement encoding works. However with the cavea...