Winter Rant

… now that Twitter is dead.

Across the Fediverse: Testing Tags

As I was writing my previous post on how tags with spaces (and dashes and underscores) were federating, I wanted to test this more systematically. So, this is that post. It is merely a test balloon to assess current behavior.

I will be editing this post with the “Results” in the section below. But let me start with the parameters of the Experiment Setup and Hypothesis. Here’s the outline for the rest of the post:

Experiment Setup

Here are all the tags that I am trying…

It will be curious to see in the different ways in which this blows up (or works out) 😁🤣

For what it’s worth, this is how it shows up in my WordPress blog post preview:

At least WordPress does not seem to have an issue in displaying all these tags with the special characters (at least in preview mode). Be interesting to see what happens when I publish; and certainly what happens when this federates. 🤞

Hypothesis

All the special characters will get filtered out when this blog post federates. And the character that shows up after the special character will get capitalized.

So, this is what I would expect:

WordPress tagExpected Mastodon Tag
“doubleQuoteTag”doubleQuoteTag
‘quoteTag’quoteTag
backslash/tagbackslashTag
colon:tagscolonTags
dash-tagsdashTags
exclaim!tagsexclaimTags
fullStop.tagsfullStopTags
n–dashTagsnDashTags
plus+tagplusTag
semicolon;tagssemicolonTags
slash/tagslashTag
space tagsspaceTags
underscore_tagsunderscoreTags
quote’tagsquoteTags
doublequote’tagsdoublequoteTags

Results (pending)

First, let me just post the screenshots from both WordPress and Mastodon. Then I will dig into how the results panned out (using my table from the Hypothesis section). Finally, I will callout some conclusions.

Raw Screenshot: WordPress (live, i.e., non-preview)

WordPress stayed true to what it promised in the preview mode (from the screenshot above). It seems to have shown all tags, with the special characters intact. At least, I cannot spot any exceptions.

Raw Screenshots: Mastodon

I grabbed these screenshots soon after this blog post federated over to Mastodon and before I made any edits after-publishing from WordPress.

Here is the full post with the “Read more >” UX. All tags seem to have federated.

Here is another screenshot of the post on Mastodon when I opened to read the full post: all hashtags show up at the bottom of the post (expected behavior at this point for federated WP posts).

Results: Expected vs. Reality

WordPress tagExpected Mastodon TagActual Mastodon tag
“doubleQuoteTag”doubleQuoteTagDoubleQuoteTag_
‘quoteTag’quoteTagQuoteTag_
backslash/tagbackslashTagbackslashTag
colon:tagscolonTagscolonTags
dash-tagsdashTagsdashTags
exclaim!tagsexclaimTagsexclaimTags
fullStop.tagsfullStopTagsfullStopTags
n–dashTagsnDashTagsnDashTags
plus+tagplusTagplusTag
semicolon;tagssemicolonTagssemicolonTags
slash/tagslashTagslashTag
space tagsspaceTagsspaceTag
underscore_tagsunderscoreTagsunderscoreTags
quote’tagsquoteTagsquoteTags
doublequote’tagsdoublequoteTagsdoublequoteTags

Conclusions

  1. WordPress did not drop any tag on the WP blogpost.
  2. No tag was dropped in the federation. All tags show up on Mastodon.
  3. The translation for all tags match my expectations except for “doubleQuoteTag” and ‘quoteTag’ (I talk about that next…)

“doubleQuoteTag” and ‘quoteTag’

In retrospect, I made a mistake here, but I also did not fully expect what actually showed up.

If my mind was working correctly, I should have expected:

  • “doubleQuoteTag”… translates to … DoubleQuoteTag
  • “quoteTag”… translates to … QuoteTag

But surprisingly, in addition to that an underscore got added at the end! What I got instead was DoubleQuoteTag_ and QuoteTag_. I think I might have tugged at a fun corner case here.

Overall, I think my hypothesis holds, expect for trailing special characters:

All the special characters will get filtered out when a WP.com blog post federates. And the character that shows up after the special character will get capitalized (at least in English). But if the special character is a trailing character, then it gets replaced by an underscore (_) character.

– vijay, watching Silicon Valley 📺

One response to “Across the Fediverse: Testing Tags”

  1. […] Another fun note: “software-engineering” on WP became “#softwareEngineering” on Mastodon. So it seems that this #tag translation does not just apply to tags with spaces, but also dashes, and perhaps other special characters(?). (Will test soon! Update: testing done, and special character handling is happening.) […]

    Like

Leave a comment