Search found 1094 matches

by Maël
17 Feb 2019 19:09
Forum: Support
Topic: unicode two bytes chars, how can I enter ?
Replies: 8
Views: 5030

Re: unicode two bytes chars, how can I enter ?

It's planned, as mentioned in the forum thread linked above.
by Maël
17 Feb 2019 18:50
Forum: Support
Topic: unicode two bytes chars, how can I enter ?
Replies: 8
Views: 5030

Re: unicode two bytes chars, how can I enter ?

HxD does not support UTF-8 in the text column, yet. See this thread: https://forum.mh-nexus.de/viewtopic.php?f=4&t=1004 What you can do is write your text in a text editor, save it as UTF-8 without BOM, then open it in HxD. You can copy the relevant byte sequences from this file and paste them into ...
by Maël
17 Feb 2019 17:12
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 18
Views: 2418

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Combination and Application of Combining Marks have more detail on grapheme clusters and the difference to a combining character sequence, but also specify how to render and handle text segmentation. The grapheme cluster represents a horizontally segmentable unit of text, consisting of some graphem...
by Maël
17 Feb 2019 14:12
Forum: Feature and Enhancement Requests
Topic: HxD: Custom Character Encoding for Text
Replies: 8
Views: 12815

Re: HxD: Custom Character Encoding for Text

Related post "Adding support for Shift-JIS in hex editor HxD": https://www.romhacking.net/forum/index. ... ic=27943.0
by Maël
17 Feb 2019 12:45
Forum: Translations
Topic: Translation software
Replies: 10
Views: 18635

Re: Translation software

The Spanish translation (and 2 other new translations) are available for download now.
by Maël
17 Feb 2019 12:27
Forum: News
Topic: Minor update: HxD 2.2.1
Replies: 0
Views: 2785

Minor update: HxD 2.2.1

HxD 2.2.1 was released.

Here is the changelog and the download.

Currently, UTF-8 and UTF-16 support is designed. If you want to give your input about user interaction and similar wishes, now is your chance:
https://forum.mh-nexus.de/viewtopic.php?f=4&t=1004
by Maël
16 Feb 2019 22:01
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 18
Views: 2418

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

As in theory there can be arbitrarily many combining characters following a base code point, somehow a reasonable limit must be introduced, especially in a hex editor that might interpret random data, or data that is not actually text. This is important for performance reasons, but also to avoid str...
by Maël
16 Feb 2019 11:16
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 18
Views: 2418

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings

Editing in place (overwrite) from the text column would be quite problematic in variable-width encodings. I only see it working realiably with 16-bit Unicode. If I type over the composite 'ü' (only Macintosh writes this format ü) and thus write fewer bytes, where would the padding go? There would b...
by Maël
16 Feb 2019 08:57
Forum: Feature and Enhancement Requests
Topic: Color coding with word processor like toolbar for structures like MBRs
Replies: 3
Views: 475

Re: Color coding with word processor like toolbar for structures like MBRs

I noticed that the new Data Inspector already has WideChar decoding, I'd like to see that as one of the choices for the text column, to read more than one symbol at a time. Maybe other fixed width fields from the Inspector could be candidates for a tabular display in a column, with a spin control t...
by Maël
10 Feb 2019 08:23
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 18
Views: 2418

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Another issue is the Overwrite mode. When a character (in the user sense, so a grapheme cluster) is overwritten, usually a character that has an encoding of the same length (=amount of bytes) is expected. With variable-length encodings this cannot be guaranteed. The most sensible would be to replace...
by Maël
10 Feb 2019 08:19
Forum: Feature and Enhancement Requests
Topic: HxD: Custom Character Encoding for Text
Replies: 8
Views: 12815

Re: HxD: Custom Character Encoding for Text

1A=textstring1 1B=textstring2 Encodings of this format (few bytes that expand to a lot of characters) can be supported if the character string is seen as an unmodifiable symbol. That is, like a normal character, it will be either deleted in its entirety or inserted in its entirety. The individual c...
by Maël
09 Feb 2019 16:24
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 18
Views: 2418

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

Searching for Shift-JIS (and other MBCS) can be done when the search pattern is converted to the byte sequence following the Shift-JIS encoding, and then searching for the byte sequence, disregarding any character notions. Since there is no notion of characters, but only bytes, it doesn't matter whe...
by Maël
09 Feb 2019 16:00
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 18
Views: 2418

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)

UTF-16 is self-synchronizing on code units level (16-bit words), but not on byte level. https://en.wikipedia.org/wiki/UTF-16#U.2B010000_to_U.2B10FFFF Since the ranges for the high surrogates (0xD800–0xDBFF), low surrogates (0xDC00–0xDFFF), and valid BMP characters (0x0000–0xD7FF, 0xE000–0xFFFF) are ...
by Maël
09 Feb 2019 11:06
Forum: Feature and Enhancement Requests
Topic: Color coding with word processor like toolbar for structures like MBRs
Replies: 3
Views: 475

Re: Color coding with word processor like toolbar for structures like MBRs

Good example of color coding: https://en.wikipedia.org/w/index.php?title=Shift_JIS&oldid=878196726#Shift_JIS_byte_map Not because all colors are dissimilar enough, but because they are used in large contiguous patches, which makes it easier to distinguish, and more organized. Here is another example...
by Maël
08 Feb 2019 09:37
Forum: Feature and Enhancement Requests
Topic: Text column support for UTF-8, UTF-16 and other multi-byte text encodings (variable width encodings)
Replies: 18
Views: 2418

Re: Text column support for UTF-8, UTF-16 and other multi-byte text encodings

I hope I haven't missed any important properties of typesetting or Unicode that could affect text rendering in hex editors (especially as mentioned in the summary in the post just above this one). Possible points I missed are listed in this table: https://en.wikipedia.org/wiki/Template:Unicode_navig...