this post was submitted on 26 Nov 2025
12 points (100.0% liked)

Linguistics

1768 readers
14 users here now

Welcome to the community about the science of human Language!

Everyone is welcome here: from laypeople to professionals, Historical linguists to discourse analysts, structuralists to generativists.

Rules:

  1. Instance rules apply.
  2. Be reasonable, constructive, and conductive to discussion.
  3. Stay on-topic, specially for more divisive subjects. And avoid unnecessary mentioning topics and individuals prone to derail the discussion.
  4. Post sources when reasonable to do so. And when sharing links to paywalled content, provide either a short summary of the content or a freely accessible archive link.
  5. Avoid crack theories and pseudoscientific claims.
  6. Have fun!

Related communities:

Resources:

Grammar Watch - contains descriptions of the grammars of multiple languages, from the whole world.

founded 2 years ago
MODERATORS
 

For example, Latin language and Roman numerals. Are people likely to mistake numbers like ID (499) or VIM (994) for words, or is it always clear enough what's what from context? I think Hebrew, Braille, and maybe Greek also do this, though i'm not as familiar with those scripts as with Latin.

I ask because i'm making a conlang and having a little trouble coming up with enough letters, let alone numerals too. Reusing letters would be helpful and probably not confusing if i make sure that numbers are never pronouncable as letters, but making this easy to read is important to me.

top 6 comments
sorted by: hot top controversial new old
[–] Nemo@slrpnk.net 6 points 1 month ago

It's usually clear from context, and also allows for some hilarious number puns.

[–] FishFace@piefed.social 6 points 1 month ago

I believe 499 would have been written CDXCIX. 994 should be written CMXCIV. You only use a single letter prefix to subtract (hence VIii not IIX) and you only use the two symbols prior to the one you're subtracting from, so you can subtract C or D from M but not X or VI.

This kept complexity of parsing down but also produced fewer forms that could be confused with real words!

[–] lvxferre@mander.xyz 2 points 1 month ago (1 children)

Besides context, like Nemo said, sequences of letters that are legit for numbers are often invalid for common words. For example, in Latin you'll never see words with "MC" (because */mk/ is phonotactically forbidden), and yet you'll see it for plenty numbers. Or "III", it pops up all the time for numbers but almost never for actual words.

On the conlanging part: if you're using an alphabet, and your phonotactics prevent consonant-only words (pretty common restriction), you can ensure the numbers are obvious as numbers by using only consonants. Another alternative would be to create one grapheme to prefix numbers with; like, instead of writing "ID" you'd write "#ID".

[–] IndigoGollum@lemmy.world 2 points 1 month ago

Most consonant-only syllables are forbidden, so i think i will use the non-syllabic consonants as numerals.

[–] Eccentric@sh.itjust.works 1 points 1 month ago

There isn't really a scientific measure of how confusing something is linguistically that I am aware of. As you and other people have pointed out, there are plenty of languages that don't differentiate orthographically between numbers and letters. It's like asking whether people will confuse read and read. Yes, they probably will in some cases, but on some level speakers understand that these words are written identically and that you require context to figure it out. So they would know the difference much more often than not

[–] mechanismatic@lemmy.world 1 points 1 month ago

You might follow the pattern of braille and have a prefix symbol that indicates the following letters are numbers. ⠼ functions like a number or hashtag symbol in front of letters A - J to indicate they're to be read as numbers, 1 - 9 & 0, not letters.