Difference between revisions of "Semantics"

From MgmtWiki
Jump to: navigation, search
(Semantic Models)
(Full Title or Meme)
 
(21 intermediate revisions by the same user not shown)
Line 1: Line 1:
  
 
==Full Title or Meme==
 
==Full Title or Meme==
When an [[Entity]] can determine the "significance" of words in sentences.<ref>Webster's Third International Dictionary, Etymology of ''Semantics''</ref>
+
When an [[Entity]] can determine the "significance" of words in sentences [or data structures].<ref>Webster's Third International Dictionary, Etymology of ''Semantics''</ref>
  
 
==Context==
 
==Context==
 
* Based on the obsolete idea that all of language can be divided into [[Syntax]] (order) and [[Semantics]] (meaning).
 
* Based on the obsolete idea that all of language can be divided into [[Syntax]] (order) and [[Semantics]] (meaning).
* Some data elements have meaning that is independent of the places where they are used. These cases are handled well with a [[Data Dictionary]].
+
* Some data elements have meaning that is independent of the places where they are used. These cases are handled well with a [[Data Dictionary]]. See that wiki page for non-context-dependent semantics.
 
* For data elements that are context-dependent, the meaning cannot be known without reference to the context. These cases need a richer model.
 
* For data elements that are context-dependent, the meaning cannot be known without reference to the context. These cases need a richer model.
  
 
==Semantic Models==
 
==Semantic Models==
 
* Semantics in data processing goes back to the 1980's when [[EDI]] and [[XML]] structures were being defined.
 
* Semantics in data processing goes back to the 1980's when [[EDI]] and [[XML]] structures were being defined.
** In the case of EDI all semantics were context dependent. The concept was introduced by Tom Jones at a meeting at MIT where a data type was defined to say that the following data values had meaning only in that document at that location.
+
** In the case of EDI all semantics were context-dependent. The concept context definition was introduced by Tom Jones at a meeting at MIT where a data type was defined to say that the following data values had meaning only in that document at that location.
 
** In the case of XML, the contexts were established at the top of the document and the specific context tag was prepended to the element name. So if the context name was "x", an element of that context would appear as x:tag.
 
** In the case of XML, the contexts were established at the top of the document and the specific context tag was prepended to the element name. So if the context name was "x", an element of that context would appear as x:tag.
* Data dictionaries appeared in the databases with the [[EAR]] or Entity Attribute Relational models used with languages like [[SQL]].
+
* [[Data Dictionary|Data Dictionaries]] appeared in the databases with the [[EAR]] or Entity Attribute Relational models used with languages like [[SQL]]. In that case the context was the table where the data meaning was described.
 
* The Semantic Web can be dated to a paper by Tim Berners-Lee's paper<ref> Berners-Lee +2, ''The Semantic Web'' Scientific American (2001-05-17) https://www.scientificamerican.com/article/the-semantic-web/</ref> in 2001 and the publication of the OWL language in 2004.<ref>W3C, ''OWL Web Ontology Language
 
* The Semantic Web can be dated to a paper by Tim Berners-Lee's paper<ref> Berners-Lee +2, ''The Semantic Web'' Scientific American (2001-05-17) https://www.scientificamerican.com/article/the-semantic-web/</ref> in 2001 and the publication of the OWL language in 2004.<ref>W3C, ''OWL Web Ontology Language
Overview'' (2004-02-10)  http://www.w3.org/TR/owl-features/</ref>
+
Overview'' (2004-02-10)  http://www.w3.org/TR/owl-features/</ref> which spawned an bewildering collection of data dictionaries which are ostentatiously called [[Ontology|Ontologies]] when what they are is just what their employers imagine the reality should be.
 +
* The result is not a collection of words that users of computer understand, but what the advertisers want them to see.
 +
* An alternate source of [[Ontology|Ontologies]] is the result of an [[Artificial Intelligence]] has determine what terms the training set is using. That might be closer to what the users that created the training set might want to know.<ref>Pascal Hitzler, ''A Review of the Semantic Web Field'' '''CACM 64''' No 2  pp. 76ff</ref> For an example see [https://www.wikidata.org/wiki/Wikidata:Main_Page WikiData] " a free and open knowledge base that can be read and edited by both humans and machines." In "2021-12-08: "Antanina Paulavičienė", the one hundred and ten millionth item, was created."
 +
* The above article by Hitzler also notes that "shallow non-expressive schemas often used for linked data appeared to be a major obstacle to reusability and initial hopes that interlinks between datasets would somehow account for this weakness did not really seem to materialize."
 +
* Knowledge graphs with central control (unlike WikiData) are "usually understood to be much more internally consistent, and more tightly controlled, artifacts."
 +
* See the wiki page on [[Semantic Model]]s for more details.
 +
 
 +
===General Semantics===
 +
This is a label for a school of thought originated by Alfred Korzybski in 1933. The core idea is that meanings arise from attempts to describe an event or objects in the real world. The label that is used acquires nuances of meaning as it is used by people, but in the end, it is just an attempt by people to categorize a collection of events or objects so that communications is possible. These collections can never be a precise as Plato would have you believe with his ideal forms. An example of the varying meaning of words used for human feelings was published by Batja Mesquita<ref>Nikhil Krishnan, ''Affective Disorder'' New Yorker (2022-08-08} p. 62ff</ref> where the idea of a constant set of names for human emotions was shown to be false. They emotions are separately collected by different cultures into different groups with different names.
 +
 
 +
==Problems==
 +
* "If men do not pour new wine into old bottles, they do something almost as bad: they invest old words with new meanings. "Work" and "energy" are venerable English words, but since the Industrial and Scientific Revolutions they have acquired entirely new definitions. They have become more abstract and divorced from directly sensed qualities of human activity; and they have become more precise, finding expression in quantitative units of measurement (foot-pound, erg) and exact scientific laws (Conservation of Energy). The word "energy" uttered in a contemporary setting may represent quite different concepts and thought processes from the word "energy" uttered in the eighteenth century."<ref> H.A. Simon, ''Designing Organizations for an Information Rich World'' https://web.archive.org/web/20201006235931/https://digitalcollections.library.cmu.edu/awweb/awarchive?type=file&item=33748</ref>
 +
 
 +
==Solutions==
 +
* 2025-04-29 [https://medium.com/@paul_15561/rethinking-semantics-a-clearer-split-for-the-future-of-data-modeling-c94ee025dc87 Rethinking Semantics: A Clearer Split for the Future of Data Modeling] Paul Knowles
 +
* Controlled effort like [https://www.snomed.org/ SNOMED-CT] have worked will within the healthcare field to create a standardized set of categories. Naturally as knowledge expands the list of categories needs to expand as well.
  
 
==References==
 
==References==
  
 
[[Category:Glossary]]
 
[[Category:Glossary]]

Latest revision as of 21:50, 20 May 2025

Full Title or Meme

When an Entity can determine the "significance" of words in sentences [or data structures].[1]

Context

  • Based on the obsolete idea that all of language can be divided into Syntax (order) and Semantics (meaning).
  • Some data elements have meaning that is independent of the places where they are used. These cases are handled well with a Data Dictionary. See that wiki page for non-context-dependent semantics.
  • For data elements that are context-dependent, the meaning cannot be known without reference to the context. These cases need a richer model.

Semantic Models

  • Semantics in data processing goes back to the 1980's when EDI and XML structures were being defined.
    • In the case of EDI all semantics were context-dependent. The concept context definition was introduced by Tom Jones at a meeting at MIT where a data type was defined to say that the following data values had meaning only in that document at that location.
    • In the case of XML, the contexts were established at the top of the document and the specific context tag was prepended to the element name. So if the context name was "x", an element of that context would appear as x:tag.
  • Data Dictionaries appeared in the databases with the EAR or Entity Attribute Relational models used with languages like SQL. In that case the context was the table where the data meaning was described.
  • The Semantic Web can be dated to a paper by Tim Berners-Lee's paper[2] in 2001 and the publication of the OWL language in 2004.[3] which spawned an bewildering collection of data dictionaries which are ostentatiously called Ontologies when what they are is just what their employers imagine the reality should be.
  • The result is not a collection of words that users of computer understand, but what the advertisers want them to see.
  • An alternate source of Ontologies is the result of an Artificial Intelligence has determine what terms the training set is using. That might be closer to what the users that created the training set might want to know.[4] For an example see WikiData " a free and open knowledge base that can be read and edited by both humans and machines." In "2021-12-08: "Antanina Paulavičienė", the one hundred and ten millionth item, was created."
  • The above article by Hitzler also notes that "shallow non-expressive schemas often used for linked data appeared to be a major obstacle to reusability and initial hopes that interlinks between datasets would somehow account for this weakness did not really seem to materialize."
  • Knowledge graphs with central control (unlike WikiData) are "usually understood to be much more internally consistent, and more tightly controlled, artifacts."
  • See the wiki page on Semantic Models for more details.

General Semantics

This is a label for a school of thought originated by Alfred Korzybski in 1933. The core idea is that meanings arise from attempts to describe an event or objects in the real world. The label that is used acquires nuances of meaning as it is used by people, but in the end, it is just an attempt by people to categorize a collection of events or objects so that communications is possible. These collections can never be a precise as Plato would have you believe with his ideal forms. An example of the varying meaning of words used for human feelings was published by Batja Mesquita[5] where the idea of a constant set of names for human emotions was shown to be false. They emotions are separately collected by different cultures into different groups with different names.

Problems

  • "If men do not pour new wine into old bottles, they do something almost as bad: they invest old words with new meanings. "Work" and "energy" are venerable English words, but since the Industrial and Scientific Revolutions they have acquired entirely new definitions. They have become more abstract and divorced from directly sensed qualities of human activity; and they have become more precise, finding expression in quantitative units of measurement (foot-pound, erg) and exact scientific laws (Conservation of Energy). The word "energy" uttered in a contemporary setting may represent quite different concepts and thought processes from the word "energy" uttered in the eighteenth century."[6]

Solutions

References

  1. Webster's Third International Dictionary, Etymology of Semantics
  2. Berners-Lee +2, The Semantic Web Scientific American (2001-05-17) https://www.scientificamerican.com/article/the-semantic-web/
  3. W3C, OWL Web Ontology Language Overview (2004-02-10) http://www.w3.org/TR/owl-features/
  4. Pascal Hitzler, A Review of the Semantic Web Field CACM 64 No 2 pp. 76ff
  5. Nikhil Krishnan, Affective Disorder New Yorker (2022-08-08} p. 62ff
  6. H.A. Simon, Designing Organizations for an Information Rich World https://web.archive.org/web/20201006235931/https://digitalcollections.library.cmu.edu/awweb/awarchive?type=file&item=33748