From Newsgroup: alt.comp.os.windows-11
...w¡ñ§±¤ñ wrote:
On 3/11/2026 5:19 PM, Maria Sophia wrote:
Maria Sophia wrote:
Or, keeping what Winston had intended
My intent has only slightly changed.
My original intent for nntp was the message header, From field and
signature using the same content.
=> ...w¡ñ§±¤ñ
But a few years ago, I changed my email Display name(in Thunderbird's Address book contact for my posting gmail address) to ...winston to accomodate email messages(non-nttp, to members of a private(html/text supported) group and a managed Google group)
As a result, even though my ETSept account 'Your name' entry settings in Tbird are configured the same as the signature...when viewing my ETSept posts in Thunderbird shows the From field as ...winston (for both email
and news messages) using the address book Display name field.
=> Unlike my other nttp clients(SeaMonkey, and WLM12) which continue
to show nntp messages using the character set(for the From and signature).
While there have been comments(for over a decade) about what others see
in the nntp header and From field...the bottom line for me - wysiwyg.
- didn't change it in the past, not going to change it now.
Though, it good to see that you found a work-around for your newsreader.
--
...w¡ñ§±¤ñ
Hi Winston,
Thanks for the explanation. It helps to see how Thunderbird, SeaMonkey,
and WLM12 each treat the W¡ñ§±¤ñ¬Ößóògîë display name differently.
That explains why you see "...winston" in some places and the decorated version in others. Thunderbird is basically overriding your NNTP identity
with whatever is in its address book, whether you asked it to or not.
I have never understood character encoding. Most of what I know has been
taught to me by servers throwing errors and turning my messages into
modern art.
The only reason I cared about your font-salad is that other people asked me
to fix it when my own outgoing messages were munged by news servers as a result of whatever is in the body of my messages that servers didn't like.
The good news is that switching my outgoing headers to ASCII seems to
have stopped the mojibake that people were reporting. Now that I think I
know what happens for some servers when raw CP1252 bytes get sent in a
UTF-8 declared message, the server behavior finally starts to makes sense.
At least now I know why some of my replies came back wearing Big-5 or ISO-8859-1 labels they never left home with.
And yes, I am glad I found a workaround for my newsreader. Nothing about
your posting style needs to change from my perspective. I just needed to
stop the nntp servers in my path from trying to "fix" things for me.
Here's my latest shortcuts.xml but it's a beast with more comments than a legacy Fortran IV codebase that NASA used for launching a toaster in space.
<?xml version="1.0" encoding="UTF-8" ?>
<!-- C:\app\editor\txt\N++\shortcuts.xml for Windows Notepad++ (N++) -->
<!-- Automatically cleans fragments, converts to ASCII & copies to clipbrd -->
<!-- Use model: Control+V (paste) & Control+B (run the macro) -->
<!-- Notepad++ does not treat shortcuts.xml as a user-editable file. -->
<!-- It treats it as an internal state file. -->
<!-- If you hand-edit it, Notepad++ will rewrite it, reorder it, -->
<!-- truncate it, or revert it to whatever it feels like doing. -->
<!-- Which means you need to always save a text copy of every version -->
<!-- grep -nP "[^\x00-\x7F]" shortcuts.xml (will show non-ASCII comments -->
<!-- v5p3 20260311 added more rules -->
<!-- v5p2 20260311 fixed the trailing tab in "<tab>this is text<tab>." bug -->
<!-- v5p1 20260311 fixed the leading tab in "<tab>this is text<tab>." bug -->
<!-- v5p0 20260311 added fullwidth punctuation normalization -->
<!-- (U+FF01 U+FF08 U+FF09 U+FF3B U+FF3D U+FF5B U+FF5D) -->
<!-- (U+FF0E U+FF0C U+FF0B U+FF0D U+FF1D U+FF3F) -->
<!-- v4p9 20260311 added combining marks (U+0304, U+030B, U+0332) -->
<!-- v4p7 20260311 fixed Tangsa U+16C00 and U+16C20 once again -->
<!-- v4p6 20260311 fix the problem that it doesn't always work -->
<!-- Scintilla may preserve CF_HTML metadata even after cutting -->
<!-- Force the clipboard to contain plain text by doing a 2x copy -->
<!-- Copy cleaned text, Insert dummy char, Delete dummy char -->
<!-- Select all, Copy again -->
<!-- So clipboard contains only the 2nd copy, which is always text-->
<!-- Fixed inaccuracies in the Scintilla-command comment section -->
<!-- Also fixed two Tangsa blocks U+16C00 and U+16C20 which did nothing prior -->
<!-- v4p5 20260311 fix the problem that a space is deleted -->
<!-- But Scintilla 2326 had deleted the character before the caret -->
<!-- So, instead of Insert space, Delete space before the caret -->
<!-- Insert a newline, Move caret left, Delete the newline -->
<!-- v4p4 Replace U+FFFD (REPLACEMENT CHARACTER) with nothing & collapse-->
<!-- Merged letters around U+00A1 (inverted exclamation mark) -->
<!-- v4p3 Replace U+FF1A (FULLWIDTH COLON) with ASCII ':' -->
<!-- v4p2 Replace U+00D7 (MULTIPLICATION SIGN) with ASCII 'x' -->
<!-- v4p1 20260213 Convert Tangsa UTF-8 (F4 86 B0 80) to a placeholder -->
<!-- v4p0 (Scintilla changes this file so always save a text backup! -->
<!-- v3p9 20260211 N++ was not running the macro in the order shown -->
<!-- But it turned out any error causes an OLDER version to run. -->
<!-- Worse, when that happens, N++ overwrites this file -->
<!-- Worse, N++ is executing macro actions in a different order -->
<!-- than they appear in the XML so a total rewrite is needed in v4p0 -->
<!-- v3p8 20260211 U+2060 is driving me nuts so it's the first block now -->
<!-- v3p7 20260211 moved U+2060 up because it's the most disruptive -->
<!-- v3p6 20260211 U+2009 & U+200B not being converted properly -->
<!-- v3p5 20260211 fixed U+200B failing when U+200B is between ' & s -->
<!-- A 2nd pass was duplicated after apostrophe normalization rules -->
<!-- v3p4 20260211 added U+275E (heavy double quote right) -->
<!-- v3p3 20260211 added U+2009 (thin space) -->
<!-- v3p2 20260211 added seven new conversions after running testcases -->
<!-- U+02BE (modifier letter right half ring) -->
<!-- U+02BF (modifier letter left half ring) -->
<!-- U+201E (double low-9 quote) -->
<!-- U+201F (double high-reversed-9 quote) -->
<!-- U+275D (heavy double quote left) -->
<!-- U+275E (heavy double quote right) -->
<!-- U+2015 (horizontal bar) -->
<!-- U+2009 (thin space) -->
<!-- v3p1 20260211 reorganized into a dozen distinct categories -->
<!-- (1) control characters: U+000F U+0001 -->
<!-- (2) dashes & minus signs: U+2010 U+2011 U+2012 U+2212 -->
<!-- (3) zero-width characters: U+200C U+200B U+200D U+FEFF U+2060 -->
<!-- (4) special spaces: U+00A0 U+2007 U+202F U+200A U+2008 U+2006 -->
<!-- (5) apostrophe-like characters:
U+0F0C U+2018 U+2019 U+2032 U+02BC U+02B9 U+02C8 U+02EE
U+201B U+02CB U+A78C U+FF07 -->
<!-- (6) combining marks (remove after apostrophes):
U+0351 U+0307 U+0331 U+0335 U+0336 U+0337 U+0338 -->
<!-- (7) double-quote normalization: U+201C U+201D -->
<!-- (8) dash-like & ellipsis & HTML entities:
U+2026 - U+2014 U+2013 - -->
<!-- (9) bullets, math symbols, diacritics:
U+2022 U+8722 U+011F U+2009 U+00E1 U+0161 U+011B -->
<!-- (10) miscellaneous symbols:
U+2713 ASCII hyphen ' U+2192 U+00B0 U+00A9 U+2122 U+00AE -->
<!-- (11) invisible operators:
U+00AD U+2061 U+2062 U+2063 U+2064 U+180E -->
<!-- (12) line separators: U+2028 U+2029 U+0085 -->
<!-- v3p0 20260211 added combining marks U+0351 U+0307 U+0331 -->
<!-- v3p1 20260211 added apostrophe-like characters U+201B U+02CB -->
<!-- v2p9 20260211 moved U+2060 to be above apostrophe-related blocks -->
<!-- v2p8 20260211 fixed Chromium CF_HTML paste control+A anomaly -->
<!-- v2p7 20260211 added U+02EE modifier letter double apostrophe rule -->
<!-- v2p6 20260211 fixed U+02C8 modifier letter vertical line) rule -->
<!-- v2p5 20260211 fixed U+02B9 (modifier letter prime) rule -->
<!-- v2p4 20260211 removed one of two U+000F blocks -->
<!-- v2p3 20260211 removed two (duplicate) 1700 lines in U+0161 -->
<!-- v2p2 20260211 fixed all zero-width blocks to replace with nothing -->
<!-- v2p1 20260211 fixed BOM to replace with nothing -->
<!-- v2p0 20260210 cleaned (emptied out) closing sections of the file -->
<!-- v1p9 20260210 ported old shortcuts.xml to improve coverage -->
<!-- Cleans Chromium pasted text & normalizes Unicode to ASCII -->
<!-- Use model: paste (using control+v) & fix (using control+b) -->
<!-- The macro should 1st break CF_HTML fragment mode (so Ctrl+A works) -->
<!-- and then run the Unicode-to-ASCII cleanup on all the pasted text -->
<!-- cutting (control+x) the result back into the Windows clipboard -->
<!-- thereby leaving the N++ GUI empty & ready for the next paste-->
<!--
To break Scintilla's CF_HTML fragment mode, we need to make any edit.
We can insert a space & then delete that space, for example.
<Action type="0" message="2001" wParam="32" /> (insert a space)
<Action type="0" message="2326" /> (delete a char to the left)
But that is dependent on the position of the cursor, which changes!
So, in v4p4, instead of Insert space, Delete space before the caret
Insert a newline, Move caret left, Delete the newline
-->
<!-- Scintilla message-id engine command meanings:
1601 = set the search string (the Unicode character to find)
1602 = set the replacement string (ASCII equivalent)
1625 = clear the replacement buffer
1700 = begin a new search/replace operation
1701 = end this search/replace block
1702 = execute Replace All
2001 = Replace the current selection with the given text
2013 = SCI_SELECTALL (selects everything)
2177 = SCI_CUT (cut all)
2304 = Move caret one character to the left
2326 = SCI_DELETEBACK (delete the character to the left of the caret)
2327 = Delete the character to the right of the caret
41001 = IDM_FILE_EXIT (close)
-->
<!-- When you paste from a Chromium-based app, the clipboard contains:
CF_UNICODETEXT (plain text) & CF_HTML (HTML fragment)
And sometimes CF_RTF where N++ prefers CF_HTML if available.
v2p0 fixes a N++ selection issue caused by CF_HTML pastes.
"HTML Paste Mode" prevents the "Control+A" from working.
"HTML paste mode" inserts HTML fragment as plain text
where Ctrl+A is disabled until the buffer is "normalized"
(until the first edit that breaks the fragment state)
-->
<!-- BELOW IS THE START OF THE NOTEPAD++ SHORTCUTS.XML FILE -->
<!-- Below is garbage that N++ adds to shortcuts.xml -->
<NotepadPlus>
<InternalCommands>
<Shortcut id="43009" Ctrl="no" Alt="no" Shift="no" Key="0" />
</InternalCommands>
<Macros>
<!-- Above is garbage that N++ adds to shortcuts.xml -->
<!-- ASCII "control+b" Cleanup Macro -->
<Macro name="ASCII" Ctrl="yes" Alt="no" Shift="no" Key="66">
<!-- Begin Scintilla HTML-paste workaround top portion -->
<!-- Break Chromium CF_HTML fragment mode by adding & deleting a line -->
<!-- Insert newline -->
<Action type="0" message="2001" wParam="10" lParam="0" sParam="" />
<!-- Move caret left -->
<Action type="0" message="2304" wParam="0" lParam="0" sParam="" />
<!-- Delete character to the right -->
<Action type="0" message="2327" wParam="0" lParam="0" sParam="" />
<!-- Select all text before running cleanup -->
<Action type="0" message="2013" wParam="0" lParam="0" sParam="" />
<!-- End Scintilla HTML-paste workaround top portion -->
<!-- BEGIN STANDARD CONVERSION BLOCKS -->
--- Synchronet 3.21d-Linux NewsLink 1.2