Wednesday, September 5, 2012

Day 3 - Team B - Aparna Vinod



OLAP Cubes in SPSS


This morning in class we were taught a new technique of data analysis called 'OLAP Cubes'. This tool adds a third dimension, depth to the regular table which consists of just two dimensions- height and width. An OLAP (Online Analytical Processing) cube is the output of a process that uses one or more scale variables along with one or more categorical values to divide the report information into layers for the depth.



What is OLAP ? 


OLAP is a buzzword today. The acronym stands for On-Line Analytical Processing. In fact, this term  does not correspond very well to the meaning of OLAP tools.OLAP tools allow the user to query, browse, and summarize information in a very efficient, interactive, and dynamic way. OLAP tools represent a vital component of both the business intelligence and data mining technology. They provide an aggregated approach to analyzing large amounts of detailed data. 


Why cubes?


OLAP databases are referred often as "cubes" since they have a multidimensional nature. Each result of querying, browsing, and summarizing can be viewed and stored as a separate cube. A cube is a visual representation of a multidimensional table and has just three dimensions: rows, columns and layers.OLAP cubes are very flexible because they allow the user to move information between these three dimensions. OLAP cubes are easy to create and manipulate. Since they provide insight into various aspects of data, these tools also represent data mining technology. Users can have multiple cubes for their business data: one cube for customers, one for sales, one for production, one for geography, etc.


Types of Variables in OLAP Cubes


1. Summary Variables
2. Grouping Variables


Steps in Producing a Three-Dimensional Table


1. Choose File --> Open --> Data and open Cell_Inter.sav

2. Choose Analyze --> Reports --> OLAP Cubes 
    The Olap cubes dialog box appears, as shown below :



3. In the list on the left of the OLAP cubes dialog box :
        a. Select relevant variables and move it to summary variables panel.
        b. Select relevant variables and move it to grouping variables panel.



4. Click on the statistics button.

The OLAP Cubes Statistics dialog box appears where you can decide what calculations you want 
SPSS to perform.

5. Click the OK button. 

The total layer of the multi-layered table will then appear. Double-clicking the OLAP Cubes table selects it and causes the appearance of pull-down lists as shown below :


By making selections from the lists, you can change the view by changing the table that appears on top.


Few Examples of Different Layers


A.      Gender of respondent: Female
          Level of education: Total
          Name of current service provider: Total


Sum
N
Mean
Std. Deviation
% of Total Sum
% of Total N
Monthly expenditure on phone
11240.00
30
374.6667
320.17520
15.5%
14.6%
Fixed component of bill
1448.00
30
48.2667
13.22206
14.6%
14.6%
Voice calls bill
1210.00
30
40.3333
23.48636
12.1%
14.6%
SMS bill
703.00
30
23.4333
11.91256
12.7%
14.6%
Other charges
200.00
30
6.6667
12.34094
17.4%
14.6%



B.     Gender of respondent: Male
         Level of education: Total
         Name of current service provider: Total


Sum
N
Mean
Std. Deviation
% of Total Sum
% of Total N
Monthly expenditure on phone
61393.00
176
348.8239
151.16696
84.5%
85.4%
Fixed component of bill
8466.00
176
48.1023
20.51733
85.4%
85.4%
Voice calls bill
8775.00
176
49.8580
29.47846
87.9%
85.4%
SMS bill
4816.00
176
27.3636
18.40819
87.3%
85.4%
Other charges
947.00
176
5.3807
11.00844
82.6%
85.4%



C.    Gender of respondent: Male
        Level of education: Total
        Name of current service provider: BSNL


Sum
N
Mean
Std. Deviation
% of Total Sum
% of Total N
Monthly expenditure on phone
6325.00
20
316.2500
95.00963
8.7%
9.7%
Fixed component of bill
960.00
20
48.0000
24.02849
9.7%
9.7%
Voice calls bill
885.00
20
44.2500
25.04076
8.9%
9.7%
SMS bill
665.00
20
33.2500
20.98088
12.0%
9.7%
Other charges
120.00
20
6.0000
11.87656
10.5%
9.7%

                       
                                                                              
D.     Gender of respondent: Male
         Level of education: Total
         Name of current service provider: Hutch'


Sum
N
Mean
Std. Deviation
% of Total Sum
% of Total N
Monthly expenditure on phone
22127.00
66
335.2576
123.93042
30.5%
32.0%
Fixed component of bill
3114.00
66
47.1818
19.88344
31.4%
32.0%
Voice calls bill
3200.00
66
48.4848
24.69771
32.0%
32.0%
SMS bill
1656.00
66
25.0909
16.90856
30.0%
32.0%
Other charges
216.00
66
3.2727
8.26958
18.8%
32.0%

                                                                                                        

E.     Gender of respondent: Female
         Level of education: Total
         Name of current service provider: BSNL


Sum
N
Mean
Std. Deviation
% of Total Sum
% of Total N
Monthly expenditure on phone
1283.00
5
256.6000
104.13117
1.8%
2.4%
Fixed component of bill
237.00
5
47.4000
17.85497
2.4%
2.4%
Voice calls bill
265.00
5
53.0000
38.01316
2.7%
2.4%
SMS bill
85.00
5
17.0000
14.83240
1.5%
2.4%
Other charges
.00
5
.0000
.00000
.0%
2.4%

                                                                                                        

F.     Gender of respondent: Female
         Level of education: Total
         Name of current service provider: Hutch


Sum
N
Mean
Std. Deviation
% of Total Sum
% of Total N
Monthly expenditure on phone
3502.00
9
389.1111
97.57106
4.8%
4.4%
Fixed component of bill
404.00
9
44.8889
8.70983
4.1%
4.4%
Voice calls bill
400.00
9
44.4444
22.97341
4.0%
4.4%
SMS bill
210.00
9
23.3333
11.98958
3.8%
4.4%
Other charges
65.00
9
7.2222
10.92906
5.7%
4.4%



By analysing the three dimensions of available data using OLAP Cubes, one can have a multi-faceted view of consumer trends and other patterns.















No comments:

Post a Comment