![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
|Hi, | |I have to incorporate in a cube data coming from different sources. This |data is a flat text file, the result of a an sql query with multiple joins. |"Parent" data is being repeated over and over, for each "Child" data. | |I intend to store this file in a single fact table, and use the same fact |table to create the different dimensions. A short test has shown me it |works. | |I understand that this way of storing data is not optimal: an foreign key is |certainly smaller than anything else. But this way, I can avoid recreating |an sql structure, by splitting data into a pure fact table, and dimension |tables, with keys and joins inbetween. | |In this context, does it make sense to use one single fact table, and no |dimension table, in order to build an olap cube? Is the resulting OLAP cube |supposed to be correct, even if similar dimensions rows are being repeated |several times? | |Thanks | |Philippe Lang | | | |
#3
| |||
| |||
|
|
Hi Philippe, Welcome to the jungle. A few weeks back (26-Mar-04 to be precise) I started out with a pretty similar question and am sharing the following based on the replies and suggestions that I got from so many helpful folks here. Before I go on, I would suggest that DO try to go through the various posts and these are EXCELLENT learning ground. I also started out using a single table as the fact as well basing the dimensions on it...u know what...I was getting results too but never checked them because I learned that my approach was wrong. 1) Go through posts with the subject "ETL"... you need to do some transformations which will extract the tables for your underlying dimensions. I'm told that the best approach is to base your fact and dimensions on views rather than tables. I am doing this but to be honest am not clear about the advantage... Well what do you expect from a learner ![]() 2) Make sure that you use data types which occupy small 'space'... treat this as a watch out factor 3) Make surrogate keys... Again something that I have learned from here and these is supposed to be the 'right approach' 4) In summary do not use a single table for your fact as well as dimensions. You might be getting the results (and maybe even right results) but you will run into problems soon. What problems? This is where the gurus will come in... I hope I have ben able to reply to at least some of your questions. I really hope so because this is my FIRST reply where I feel that I am sharing whatever I have gained from here and am trying to help out another "Learner" Sorry for the lengthy post. Good Luck. P.S. Log in the hours and the efforts and you'll be excited with where you stand in just a few weeks. Oh yes,,,,, and then you will also be running into a front-end OLAP tool selection. This part I will not comment on YET and am in the evaluation process myself..have come across some neat tools ![]() |Hi, | |I have to incorporate in a cube data coming from different sources. This |data is a flat text file, the result of a an sql query with multiple joins. |"Parent" data is being repeated over and over, for each "Child" data. | |I intend to store this file in a single fact table, and use the same fact |table to create the different dimensions. A short test has shown me it |works. | |I understand that this way of storing data is not optimal: an foreign key is |certainly smaller than anything else. But this way, I can avoid recreating |an sql structure, by splitting data into a pure fact table, and dimension |tables, with keys and joins inbetween. | |In this context, does it make sense to use one single fact table, and no |dimension table, in order to build an olap cube? Is the resulting OLAP cube |supposed to be correct, even if similar dimensions rows are being repeated |several times? | |Thanks | |Philippe Lang | | | -- -- Thanks. |
#4
| |||
| |||
|
#5
| |||
| |||
|
|
Hi, I don't want to misguide you since I'm a starter myself. Please allow me a little time to get back to you with useful replies to your actual questions. Till then However please see below for an initial suggestions: Look out for replies by the following people. However I must clarify that there are MANY knowledgeable folks over here (microsoft.public.sqlserver.olap)and I am only listing some of the names whose replies have REALLY helped me (or whose replies to someone else's questions have added to my knowledge and clarifying my concepts). BTW, the order in they are listed does not represent any thing and Isincerely that hope I will not offend anyone by listing the following/missing out their name: Michael Vardinghus, Andrej Hudoklin, Graeme Scott, Steve Kass, Alejandro Mesa, Tom Chester, Abhinav Kumar, Dave Wickert, Nigel Pendse, Allan Mitchell, Deepak Puri, Andrea Temporin, and many more. Okay enough of the Grammy nomination thank you speech (Ooppss... I forgot to mention my family members). Regards. P.S. I am surprised why none of the folks have replied to your post yet??? Unless they've sent you reply via email. |
![]() |
| Thread Tools | |
| Display Modes | |
| |