Did you know that the English language consists of over 200,000 words? Of these 200,000 words, many have the same meaning but different spelling (synonyms). Some words even have the same spelling but different meanings (homographs). Confusing, right?
Let’s imagine you were talking to your colleagues about your recent trek to the coast and you wanted to explain the faults you saw. Are you talking about errors in a system or geological faults? To further complicate your chat, what if you used ‘faults’ whilst your colleagues use ‘cracks’ to define the same geological phenomena? It’s a wonder we ever understand each other at all.
Figure 1: Fault, with all related synsets and definitions
Generally, we don’t confuse one another, because we’re aware of the different terms for things and often understand the context of both the situation being described and the conversation serving to describe it (context is a whole topic in itself that we’ll address another time).
But every term for a thing (a noun) actually sits within a hierarchical structure that connects every noun to every other noun, from the least generic nouns to the most generic, called an entity.
Of all things, 43.9 percent have multiple nouns to describe the same thing. Whilst 18.7 percent of nouns are homographs — spelt the same, but with differing definitions. Experience teaches us to identify these words and when to use the right one. To look at this structure in detail, you can clearly see just how complicated the network of nouns each of us navigates each day really is.
The hierarchy is made up of synonyms and hyponyms (think of it like a synonym that increases specificity). Direct synonyms are words that have the same meaning, such as fault and crack. Their synset, or sets of synonyms, is fault.n.04, where the first part is the word that relates them, the “n” states it is a noun, and 04 means it is the fourth definition of fault. In this example, the words are found on the same level in the hierarchical structure of nouns, however not all words on the same level are synonyms of each other.
Levels in the structure are based on the generality of the noun. Entity is the most general noun or root hypernym, which all nouns lead up to. If we traverse down the hierarchy, going down to each hyponym, we get to more specific words, such as San Andreas Fault and inclined fault.