Search found 1220 matches

by Maël
18 Apr 2020 21:51
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Bidi Algorithm and ScriptLayout Span is an inline element (which cannot cross block/paragraph boundaries), but it can word-wrap. For example this HTML-code: <html><head></head><body> <div>The following span is an <span style=" background-color:#ee3;">inline element</span>; its background has been c...
by Maël
12 Apr 2020 09:27
Forum: Feature and Enhancement Requests
Topic: Please follow up to questions! Unclarity = unlikely to implement
Replies: 0
Views: 2225

Please follow up to questions! Unclarity = unlikely to implement

Please subscribe to topics you post, and make sure you get notifications, such that follow up questions/comments can be answered.

Unclear requests cannot be implemented.

The level of engagement also influences my motivation to put work into implementing a feature.
by Maël
12 Apr 2020 04:59
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

HxD has no notion of a paragraph, but ScriptItemize() from Uniscribe requires an entire paragraph for proper bidi analysis. We cannot simply ignore bidi text either, nor Directional Formatting Characters occuring in a string, since it will affect the rendered text, especially if we split a longer te...
by Maël
11 Apr 2020 17:27
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Practical line limits due to Uniscribe Breaking up text into lines needs to consider an important limit imposed by Uniscribe. From https://sourceforge.net/p/scintilla/bugs/1129/?limit=10&page=2 : In Mozilla's source code in Uniscribe related code there's this comment, which fits perfectly: "Any ite...
by Maël
10 Apr 2020 04:24
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

https://docs.microsoft.com/en-us/windows/win32/intl/uniscribe-glossary#abc-width This link explains ABC width, and would allow to deal with the overhanging H problem Ἧ where the leading decoration is clipped, when normal TextOut is used. Test if ABC width for a text line would include this overhang,...
by Maël
10 Apr 2020 03:52
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Line-wrapping ("word" wrapping -- really grapheme cluster wrapping) Independently of RTL text, line wrapping while respecting grapheme cluster boundaries, may cause rendering of more characters per line, than bytes (hex pairs) per line, since grapheme clusters that are begun on one "hex line" will ...
by Maël
10 Apr 2020 03:52
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

The logical order of characters in a file is displayed as left to right in a hex editor, also for RTL text. So while increasing file offsets for LTR text mean increasing horizontal pixel offsets, for RTL text, increasing file offsets mean increasing horizontal pixel offsets from right to left (or de...
by Maël
09 Apr 2020 19:50
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Understanding bi-directional text https://www.iamcal.com/understanding-bidirectional-text/ There is a file order of code points (and the grapheme clusters they "agglomerate to") and a display/"user reading" order of these "characters". For Arabic text, which is usually RTL, the codepoints will have...
by Maël
08 Apr 2020 20:53
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Side-node: UTF-8 Everywhere is not a good solution While doing research I stumbled once again on the UTF-8 everywhere wish: http://utf8everywhere.org std::string and char* variables are considered UTF-8, anywhere in the program. Among many reasons why UTF-8 does not solve the issues of transparentl...
by Maël
08 Apr 2020 18:23
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

More interesting comments from James Brown from catch22: Well I'm still confused about this whole thing :-) I'm now looking at Word 2003 and notice that it has the same behaviour as Notepad. I am using the following Unicode code-point from the Arabic script: 064a 064f 0633 0627 0648 0650 064a I have...
by Maël
08 Apr 2020 17:50
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Interesting problem for Arabic text: More on cursor movement وِوِوِوِوِوِوِوِوِوِوِوِ Notepad (in current Windows versions, at least 8 and up) behaves correctly, regarding backspace and delete key, as you would expect: delete key removing an entire grapheme cluster, backspace each codepoint/combinin...
by Maël
08 Apr 2020 17:48
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 62
Views: 20408

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

From http://archives.miloush.net/michkap/archive/2006/05/31/611340.html: Fun wrappers around text rendering like Uniscribe, TabbedTextOutW, and DrawTextExW, will end up being treated in XP SP2, Vista, and other recent platforms as a complex script, while both the simpler (e.g. SetWindowTextW) and lo...
by Maël
05 Apr 2020 06:56
Forum: Feature and Enhancement Requests
Topic: Edit -> Select Block... (Ctrl+E)
Replies: 3
Views: 1099

Re: Edit -> Select Block... (Ctrl+E)

I suppose you mean selecting backwards, i.e., set the start offset, then select x bytes before that.

This could be achieved by allowing negative numbers for Length.

Again, please give an example for the numbers of end-offset and length and what behavior you would expect.
by Maël
02 Apr 2020 22:36
Forum: Feature and Enhancement Requests
Topic: Edit -> Select Block... (Ctrl+E)
Replies: 3
Views: 1099

Re: Edit -> Select Block... (Ctrl+E)

Hello,
I don't understand what you mean. What's wrong with the way it works now? Please give an example.
by Maël
02 Apr 2020 20:06
Forum: Support
Topic: Upgrade from 1.7.7.0 to 2.4
Replies: 1
Views: 910

Re: Upgrade from 1.7.7.0 to 2.4

The change log lists all the relevant changes since, version 1.7.7.0.
https://mh-nexus.de/en/hxd/changelog.php#v_2.4.0.0

The installable version has the advantage that you can integrate it in the context menu.