Chaos and Correlation
International Journal, March 16, 2008

Artificial Intelligence System for Identification of Social Categories of Natives Based on Astronomical Parameters

Eugene Lutsenko (Russia), Alexander Trunev (Canada)

 

The cognitive simulation of AstroDatabank records by using the Artificial Intelligence System – AIDOS, is reviewed in this paper. The technology of simulation is described and the mostly important results are discussed.  

 

Keywords: Semantic Information Models, astrodatabank, Astronomical and Sociological Databases, Neuron-net Training, Numerical Experiment.

 

Introduction

New method of identification of a birth chat based on system-cognitive analysis and on the advanced information theory [1] was developed recently [2-3]. This method differs from the normal astrological models so that the birth chat is not interpreted, but it is identified by using a number of attributes and categories, by comparing with the astrological database [4-5], which includes a description of the many key events in real life of real persons. As a result of the identification each person receives a customized description contains classes and categories of events, indicating the likelihood of their implementation.  In this research not used any astrological interpretation or any astrological rules. Statistical patterns and the correlation revealed in the data processing of the artificial intelligence system by comparing birth charts and biography. Test examples demonstrate the effectiveness of the system for the recognition of certain classes of entities.

Input Databases

The main source of astrological database prepared for the artificial intelligence system simulation is the original (first version) Lois Rodden's AstroDatabank [4] and AstroDatabank v. 4.0 [5]. These databases contain biography of famous and ordinary people so that all the categories and events of life are classified and ordered.

Data imported from AstroDatabank v. 4.0 were converted into a DBF4 format database. Only 9897 records have been utilized including 5 categories shown below with corresponding number of records:

Table 1: Four classes, 5 categories and related number of records

KOD_OBJ

NAME

ABS

1

Politics, Science

1876

2

Medical: Physician

347

3

Sports

6032

4

Psychological

1642

Note, 184 records are repeated among 9897 since they related to 2, 3 or 4 categories listed above.  Records were cooperated in four classes as shown in Table 1. Every record has 23 active numerical cells consist of coordinates of celestial bodies, Ascendant and Midhaven at the moment of birth and in the place of birth, i.e.:

·        Longitude (degree) of the Sun, the Moon, Mercury, Venus, Mars, Jupiter, Saturn, Uranus, Neptune, Pluto, North Node, Ascendant and Midhaven;

·        Declination (degree) of the Sun, the Moon, Mercury, Venus, Mars, Jupiter, Saturn, Uranus, Neptune, Pluto.

From this database were derived two databases to study a declination effect on the similarity parameter:

1.      Database1 with 23 active numerical cells in every of 9897 records as described above but all Declination parameters were adapted to the longitude interval (0; 360) by using formula: Declination1 = (Declination +30)*6;

2.      Database0 with 23 active numerical cells in every of 9897 records as described above, but all Declination parameters were recalculated as follows: Declination0 = Declination *0, also for all records we put Ascendant= Midhaven =0, therefore only Longitude of the Sun, the Moon, Mercury, Venus, Mars, Jupiter, Saturn, Uranus, Neptune, Pluto and North Node have been utilized in this database.

After this minor adaptation all 23 cells have one scale and format, therefore they could be analyzed in the same manner as well as the declination parameter effect on the simulated outcomes could be studied. 

The data imported from original Lois Rodden's AstroDatabank were converted into the Borland JDataStore format databases. Then, the data were sorted using SQL queries and special functions written in Java.  Only 20007 records related to 1931 categories and events have been utilized in this research. For these records were calculated coordinates of celestial bodies (latitude and longitude in degrees, and the distance in astronomical units).  12 cusps of astrological houses in the Placidus system were calculated for records with the exact time of birth. The ephemeredes following celestial bodies and points were established: the Sun, the Moon, Mercury, Venus, Mars, Jupiter, Saturn, Uranus, Neptune, Pluto, and North Node.  The next step is sorting by category of records.  As result XML tree categories reference database was obtained. Next, the database has been completely exported in Excel and then it converted to the DBF4 format (which accepted by the artificial intelligence system).  Only 23 active numerical cells in every of 20007 records were utilized in this research, i.e.: Longitude (degree) of the Sun, the Moon, Mercury, Venus, Mars, Jupiter, Saturn, Uranus, Neptune, Pluto, North Node, and 12 cusps of the astrological houses (houses in the Placidus system). From this database were derived several databases:

1.      Database A  of 20007 records related to 500 representative categories (category represented in the database at least 26 times);

2.      Database B of 15007 records related to 500 representative categories - training data set;

3.      Database C of 5000 records which are not used in the Database B (but used in the Database A) – recognized data set.

4.      Database D  of 20007 records related to 240 unrepresentative categories (number of records related to category higher than 2 and less than 25) – low frequency limit;

5.      Database E  of 20007 records related to 870 categories (number of records related to any category higher than 2) – mostly complete database;

6.      Database F of 20007 records related to 37 categories (number of records related to any category higher than 1000) – higher frequency limit;

7.      Database F1 of 20007 records related to 100 categories (number of records related to any category higher than 174);

8.      Database G of 20007 records related to 4 categories listed below in Table 2, b. In this database 8150 records are not involved in a simulation.

Table 2: Four classes, four categories and related number of records in a case of Database G

KOD_OBJ

NAME

ABS

1

Famous

3373

2

Medical

2910

3

Sports

4567

4

Psychological

1007

 

 

Note 20007 records are related to the original (first version) Lois Rodden's AstroDatabank [4] and AstroDatabank v. 4.0 [5] as well.  The difference between these databases is that latest version updated with more than 5000 records, and it is a reason why the same category SPORT has different records in Table 1 and 2.   

The Model and the Artificial Intelligence System - AIDOS

As well know there are several ways to decompose Zodiac circle in a process of analyzing a birth chart:

·        day and night houses partition – 2 sectors;

·        Cardinal signs, fixed and mutable signs – 3 multiply connected sectors.

·        squares – 4 sectors;

·        partition based on element of fire, earth, air and water – 4õ3 sectors;

·        zodiac signs – 12 sectors;

·        decants – 36 sectors;

·        terms – 60 sectors;

·        Degree – 360 sectors.

Decomposition combinations such as those listed above seem to resemble algorithms of grid simulation widely used in a modern science, in which condensation of the grid helps improve convergence in solving the task. We utilized this method in order to perform packet recognition of 9897 or 20,007 records exported from AstroDatabank and presented as DBF4 format databases.  In order to do this a solution founded based on data from 172 grids of various dimensions, containing 2, 3, 4, ..., 173 sectors consequently (it is a limit for this task at the moment). Thus the net entropy effect could be established during this simulation with the system of artificial intelligence AIDOS [2]. 

Standard AIDOS package includes 7 subsystems and 85 programmable applications organized in a block structure – see Table 1. Generally speaking it is a neuron-net computer application running under Windows XP in MS-DOS mode, designed with CLIPPER 5.01, Tools-II and BiGraph 3.01, provided the following objectives:

1.      Synthesis and adaptation of the semantic data model.

2.      Identification and forecasting.

3.      Precise analysis of the semantic data model.

Table 3: Generalized Structure of the Universal Cognitive Analytical System AIDOS, v. 12.03.2008

Subsystem

Mode

Function

Operation

1.

Formalization

1. The classification scale and graduation

2. Descriptive scale (and graduation)

3. Graduation descriptive scales (signs)

4. Hierarchical systems levels

1. Levels of classes

2. Levels of signs

5. Software interfaces for importing data

1. Import data from TXT files-standard DOS-text

2. Import data from DBF files (Standard Prof. A. N. Lebedev)

3. Imports from transposed DBF files (Standard Prof. A. N. Lebedev)

4. Generation scales and training set RND model

5. Generation scales and training sample for the numerical study

6. DBF-matrix transposition of baseline data

7.Import data from DBF files (Standard E. Lebedev)

6. Postal Service to INS

1. Exchange grade

2. Exchange of generalized signs

3. Exchange of primary signs

7. Printing questionnaire

2.

Synthesis SIM

1. Writing-adjustment training set

2. Management of training sample

1. Parametric objects for processing job

2. Statistical parameters, hand sample

3. Auto sample of training set

3. Synthesis of semantic data model SIM

1. The calculation of the absolute frequency matrix

2. Excluding artifacts (robust procedure)

3. Calculation information matrix SIM-1 and converting into executable information matrix

4. The calculation of conditional interest distributions SIM-1 and SIM-2

5. Automatic execution regimes 1-2-3-4

6 . Measurement of convergence and stability model

1. Convergence and sustainability SIM

2. Dependence validity of the model training set

7. Calculation information matrix SIM-2 and converting into executable information matrix

4. Postal Service to educational information

3.

Optimizing SIM

1. Formation of the classes orthonormal basis

2. Excluding signs of a low selective force

3. Removing classes and attributes for which insufficient data

4. Divisions on the part of the typical and atypical

5. Generation of associated signs and convert a training sample

4.

Transcribing

1. Writing-adjustment recognizable sample

2. Batch recognition

3. The withdrawal of recognition results

1. Cut: an object - a lot of classes

2. Cut: one class - many sites

4. Postal Service recognizable sample

5. Construction of the functions of influence

6. Decoding combinations recognizable signs in the sample

5.

Typology

1. Typological analysis classes recognition

1. Information (rank) portraits (classes)

2. Classes cluster and constructive analysis

1. The calculation of similarity matrix of classes images

2. Generation of clusters and constructs classes

3. Viewing and printing cluster, and constructs

4. Automatic execution modes: 1,2,3

5. Conclusion 2D semantic networks classes

3. Cognitive charts of classes

2. Typological analysis of the primary signs

1. Information (rank) portraits of signs

2. Signs cluster and constructive analysis

1. The calculation of similarity matrix of signs images

2. Generation of clusters and constructs signs

3. Viewing and printing cluster, and constructs

4. Automatic execution modes: 1,2,3

5. Conclusion 2d semantic networks signs

3. Cognitive signs of charts

6. Semantic-Cognitive analysis of model SIM

1. Estimation of the objects completion reliability

2. The measurement of the adequacy of semantic data model

3. Measuring independence classes and signs

4. Viewing profiles classes and signs

5. Graphic display of non-local neurons

6. Displaying subsets of the neural network

7. Classical and Integral cognitive maps

7.

Service

1. Generation Databases (dumping)

1. All database

2. NSI

1. All databases NSI

2. Classes DB

3. Initial signs DB

4. Generalized signs DB

3. Training sample

4. Recognized sample

5. Statistics Database

2. Reload index all databases

3. Print database absolute frequencies

4. Printing conditional interest distributions Databases SIM-1 and SIM-2

5. Printing information of SIM-1 and SIM-2 Databases

6. Descriptive Intelligent Information Retrieval System

7. Copying the major SIM databases

8. Convert SIM-1 into executable information matrix

9. Convert SIM-2 into executable information matrix

 

 

The cognitive simulation of AstroDatabank records including the neuron-net training and recognition was realized for any grid of fixed dimension N=2, 3, 4…, 173 sectors. Thus there are many models – M2, M3, M4… M173 corresponding to the number of sectors in a given partition of Zodiac. For every model could be established own catalog (they are numbered simply as 002, 003, 004 …) and a copy of the system AIDOS.  To manage the input parameters and outcomes of all models a special system has been designed [3], which would be implemented "collectives decisive rules» i. e., would the ability to automatically generate a number of models that would form one coherent system, which called "multi-model". This system consists of few programmable applications which allow setup any combination of models; run the neuron-net training and recognition for all models,   organize and summarize the results of the identification of the respondents in different models for a set of categories.

Main Results

The technology of simulation described in papers [6-8]. In fact the system AIDOS operates with Object Code like numbers in a left column in Tables 1, 2. Astronomical parameters also have own code called “scale or graduation code”, for instance, in a case of model M3 we have 23 main scales and 69=23*3 graduations; six of them shown below:

 

Code

Name of scale

1

SUNLON-[3]: {0.000, 120.000}

2

SUNLON-[3]: {120.000, 240.000}

3

SUNLON-[3]: {240.000, 360.000}

4

MOONLON-[3]: {0.000, 120.000}

5

MOONLON-[3]: {120.000, 240.000}

6

MOONLON-[3]: {240.000, 360.000}

    

If any record in a training database shows a longitude of the Sun belongs to the interval (0.000, 120.000) then a frequency of the corresponding code 1 increases on a unit. Therefore a frequency of scales in the training database could be calculated and the frequency matrix and the information matrix could be established. For example, in a case of model M2 trained with Database F, a fragment of the frequency matrix and a fragment of the information matrix are shown in Table 4a and 4b consequently:

 

Table 4a: The frequency matrix (fragment) in a case of model M2 trained with Database F (frequency is given in absolute value) [7]

 

Code of

scale

Code of category

1

2

3

4

5

6

7

8

9

10

11

12

1

6744

2623

2281

2201

1671

1477

1378

1271

1222

1201

1230

1208

2

6896

2502

2286

2270

1702

1433

1297

1306

1220

1195

1155

1152

3

6786

2539

2325

2187

1689

1445

1330

1273

1207

1211

1218

1177

4

6854

2586

2242

2284

1684

1465

1345

1304

1235

1185

1167

1183

5

6261

2401

2070

2039

1561

1343

1307

1185

1125

1086

1134

1156

6

7379

2724

2497

2432

1812

1567

1368

1392

1317

1310

1251

1204

7

6907

2688

2332

2274

1735

1510

1422

1301

1263

1193

1232

1263

8

6733

2437

2235

2197

1638

1400

1253

1276

1179

1203

1153

1097

9

7137

2760

2443

2344

1754

1500

1454

1341

1269

1223

1330

1279

10

6503

2365

2124

2127

1619

1410

1221

1236

1173

1173

1055

1081

 

Actually an information counted in the system with 8 decimal places, but in Table 4b it shown with  2 decimal position (*100) only. A positive or negative value of information in a cell ij in Table 4b means that category j has a positive or negative correlation with scale i.

 

Table 4b: The information matrix (fragment) in a case of model M2 trained with Database F (information given in Bit*100) [7]

 

Code of

scale

Code of category

1

2

3

4

5

6

7

8

9

10

11

12

1

3

-1

17

-2

-3

-0

-3

-3

-3

-3

22

-4

2

4

-3

17

-0

-3