Information Theory Approach
Information Theory Approach
Anything that exists in Real or Mental planes, any element of the largest possible domain of
existence of all times is an extity. Practically every entry you can see in a large dictionary is an extity.
Each extity has a tag, which contains definition, descriptions, attributes and information about the
extity. A tag consists of fields, each of which holds a different piece of, or set of similar pieces of,
information pertaining to the extity. Each field has a name or description.
Let e1 be an extity with the following tag:
Field name Data it holds
Field 1 Name fayv
Field 2 Symbol 5
Field 3 Writing Five, english
Field 4 Descriptions An abstract concept
Field 5 definition A number
Data module: A set of bytes that represents, holds information about or corresponds to an
extity... Data that pertain to several irrelevant extities may be represented by the same dm. (It is a
matter of language and perception, not any mathematical relation. Synonyms are the result of
irrationality of the language concept.)
Of course, it may not be possible to know whether or not a set of bytes is really representing
information pertaining to an extity or it is just a random bunch of bytes unless it has been “registered”
in some way before, and defined. Therefore, in order to know whether or not a set of bytes is a data
module, a registery or definition/description information must be associated with it. That registery
information can be a data module, batch, sequence or an entire information module. Like complex
numbers, each data module consists of a data component and a module tag. The tag is rather a
container, not a peer of data module. As a result, unless otherwise is necessary, data and tag
components shall be processed separately, not as a pair.
Several definitions are possible for a data module, depending on the aspect in focus of an
extity. For example, all of the following are possible definitions for “Philips”:
“a company”
“an electronic company”
“a company that produces monitors”
“an electronic company that produces monitors”
“a big company with 200.000 employees”
The list above are only a few examples. Even a definition that takes several pages is possible,
depending on the intended point. Tag is theoretically an endless set:Just check eksisozluk.com to have
an idea about how diversified definitions of things might be! Especially a number can be defined in
infinitely many ways.
Human brain always stores each data module together with its module tag. The tag is recalled
partially or entirely, depending on the context the extity that the data module represents is perceived.
For instance, if I am to buy a monitor, then the retrieved part of the “Philips” tag is “an electronic
company that produces monitors” or “a company that produces monitors”, and I would not care about
the number of employees Philips has. It is like using the proper meaning of a word from dictionary.
Definition of each entry in a dictionary is a module tag. A word usually has multiple meanings that
together make up the tag. Only part of the tag that corresponds to the needed meaning of the word is
retrieved. Similarly, only a subset of the tag tailored according to the particular context is employed.
That subset is the dynamic tag of the module, as compared to the static tag, which is the universal set
of definitions of the extity. Needless to say, dynamic tag is used in practice. (Rev:It may be
discussed whether or not there exists a default, or primary, dynamic tag, but I’m not gonna
spend time on it for the time being.)
In the meantime, I can define “Philips” as a data module because it has already been defined
and registered in my memory. A dictionary is a structure that consists of common and standard data
modules defined and registered in the same manner, which correspond to commonly known extities.
For instance, a cat is an extity everybody sees around everyday, so it is a commonly known extity.
When one goes to school, one learns that the byte set “cat”, which is made up of three bytes,
corresponds to the extity cat. At that point, “cat” is registered as a data module that represents the
commonly known extity cat. Regarding the previous paragraph, I already know that “Philips” is the
name of a company, but if I had not known that fact before, I would have perceived all of “Philips”,
“Philip” and “Philipss2” as equally meaningless, arbitrary sets of bytes. Since “Philips” is a registered
trademark, I know that it is a data module but “Philip” and “Philipss2” are not.
Data batch: A set of data modules, whose elements have been associated with a single,
particular extity_called the core of the batch_ according to a particular context… Many batches can be
defined for the same core. New dynamic tags are defined for the elements of a batch during the
association so as modules in the batch become dynamic tags for the core as a result of the association.
Therefore, each data batch is a subset of its core’s tag. For example, the following are three data
modules with an arbitrary dynamic tag for each:
“Philips” “name of an electronic company”
“202E” “an arbitrary alphanumeric code”
“245” “an abstract numerical concept”
Now, as the batch B1={Philips, 202E, 245} is created (Quotation marks have been omitted for
convenience.)through association with the extity “a monitor”, new dynamic tags become:
“Philips” “brand name”
“202E” “model code”
“245” “price in TL”
The core in question is C1, an extity class (not a particular object such as “the monitor used by
Bill Gates to create Windows OS in his garage”), whose dynamic tag is “Philips monitor of model
202E that costs 245”. As can be seen, the batch is a subset of C1’s tag.
But “a Philips monitor of model 202E that costs 245” (Rev:wrong! The correct form : data
module: “Philips 202E” , and its tag: “brand and model of a monitor that costs 245”
Tags of elements of the batch change through that association towards defining the core. That
is, inclusion of a module in the batch is not arbitrary but the module is a subset of the core’s tag.
A group of data modules that pertain to the same extity can be included in a set called data
batch. Elements of the batch are not arbitrary modules relevant to the same extity only, but they are
included in the batch according to a particular context. Moreover, including modules in the batch that
are not necessarily relevant make them pertinent to the extity in question by associating them. For
instance, regarding the data batch “Philips, 202E, 245”, “Philips”, “202E” and “245” represent name
of a company, an arbitrary alphanumeric code and name of an abstract numerical concept,
respectively. Associating them with a monitor_an extity_ gives each of those modules a different
meaning defined in a context, namely, brand, model and price of the monitor. The monitor in question
here is actually name of a class of objects. A monitor, for instance, with the tag “used by Bill Gates to
create Windows OS in his garage” makes it unique. This also shows that an extity may have
theoretically countless different tags.
“Philips” always pertains to a single extity (namely a company), whereas “245” is a number
that may represent practically anything. Including it in the batch creates an association
“Philips 202E 245” is a data batch, whose elements all pertain to a monitor (an extity).
Moreover, the data module “Philips” represents a single, particular piece of information and so always
pertains to a single extity (namely a company), whereas “245” pertains to the monitor just because it
has been a part of this particular batch. In another batch like “my bank accout 245 TL”, “ 245”
corresponds to my wealth deposited in a bank.
The batch “Philips 202E 245” is more meaningful than the data module “245” because it
represents more information, and more importantly clearer information than mere “245”. Now it states
the brand, model and price of a monitor. Nevertheless, the definition and degree of meaningfullness
depends on a purpose/subject/function/task/context and thus relative. In this regard, the example batch
may still be meaningless.
Each data module in the batch is a “field” of the batch. Some of the fields can even be large
information modules. (Rev: I may use the “field” for something else, such a s a phrase.)
Data sequence: A set of data modules whose elements are in a particular order. A sentence is
a data sequence whose elements corresponds to extities of different natures.
Information module: A set of ordered or unordered data modules with at least one data batch
subset, whose elements all pertain to a single extity…
Data pair: A data module is actually meaningless by itself unless it is associated with a
definition/description. For example, no one can know what “245” is, until it is associated with the
definition “Price of a Philips 202E monitor at Gold Computers”. Hence, (“Price of a Philips 202E
monitor at Gold Computers”, 245) is a meaningful pair. The definition term itself is an information
module.
Data set: A set of data modules, each of which pertains to an extity that is of the same
class/type. The difference from the data batch is that components of the batch pertain to the same
extity, whereas elements of a data set pertain to different extities although those extities are of the
same class/type.
Data batch: A data module may be still insignificant by itself with regards to a
purpose/subject/function/task/context, even though it is in data pair format.
It has to be connected with other modules to create a group so that the group becomes
significant. For instance, just a list of some TV cards (data set A) is significant only if the purpose is to
get a rough idea about how diversified the product range of TV cards. Likewise, a list of prices only
(data set B) serves the purpose of getting an idea about TV card prices. The group, which is called data
batch, created by associating one product from set A and its corresponding price from set B is more
significant because it presents price information of a product. However, it is still insignificant for
making purchase decisions. In that case, the appropriate batch must contain price, place, compatibility
and features data (4 data fields).
Data batch set: It is technically a data set whose elements are data batches, so batch
definition applies. As a matter of fact, a purchase decision mentioned above requires a data batch set
with batches that include fields for price, place, compatibility, features data, and optionally a field for
recommandation status. Each element of a data batch set is called an “entry”.
Unit information module: A sequence of data modules and possibly other elements that are
not data modules, which is structured around a main topic/thought and significant by itself, i.e., it is
compact and it makes a point or presents a piece of information. The sequence is characterized by
grammar rules, i.e., a unit information module is usually a sentence or part of a sentence. Yet,
significant sequences without a sentence format can also be created. In that case, difference between a
unit information module and a data batch diminishes. The difference from a data batch is that the
components of a uim do not have to pertain to the same extity, although the uim is structured around a
main topic. In fact, it is one of the characteristics of uim, it points to a diversified set of extities. A
specific extity may be the main topic, but even in that case, a uim employs…
Information module: Union of unit information modules such that all uim pertain to a
common topic or extity, “thing referes to any object, event, concept, etc. that exists in real or mental
plane.