![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
which would mean that you must have the EE to do development or learning)? |
|
This seems like a killer to me, because it means that underlying relational (or whatever) data must be designed in anticipation of pulling the data into a cube, which would be fundamentally in opposition to the whole point of data mining. |
#3
| |||
| |||
|
|
As I understand it, there may only be a single data source per cube. Is there something I am missing here (I'm not clear about how multiple partitions would affect this since the personal edition of AS does not allow it, which would mean that you must have the EE to do development or learning)? This seems like a killer to me, because it means that underlying relational (or whatever) data must be designed in anticipation of pulling the data into a cube, which would be fundamentally in opposition to the whole point of data mining. I'm aware that there are ridiculous workarounds such as creating views for all objects within the target database/datasource, but it takes something fundamental to MS SQL Server- multiple databases on a single server that are accessible by each other- and throws it out the window. What's up with that? |
#4
| |||||||
| |||||||
|
|
its always recommended to use views to populate AS objects (cubes, dimensions) these views allows you to insure that you provide the right information to the cube regarding the databases structure changes. also, in AS, a cube is created using 1 fact table only and the dimensions must come from the same database. Without 1 point of access, how can you insure that the product id 1 is the same in all databases? |
|
I don't understand why you found this solution is an opposite of a dataming solution. |
|
A good datamining solution must be done using good & clean data. |
|
Also you must be able to create test data to validate your datamining model. If you have multiple databases & multiple sources, creating a good model starts to become a real problem for your analysts. |
|
what you found a limitation is exactly what a datawarewhouse must be: having 1 point of access for the information with cleansed & easy to use data, for data mining, olap analysis or reporting. to insure that all the users access the same information with good performance & shared metadata. |
|
Also, I have a question for you, you have multiple databases on the same server... does this mean that your databases have the same schema & the same information? |
|
why you don't merge your databases? |
#5
| |||
| |||
|
|
AS2k5 lifts this restriction and allows you to pull data from multiple datasources into the one cube. |
|
Although in AS2k, it's not as silly a restriction as it may seem. Most of the time when you are dealing with multiple datasources, they are not all consistent and you need to go through some sort of transformation step after which you load the cleansed data into a central data warehouse. |
|
In other situations people do not want to query the source systems directly from AS as this could affect performance. |
#6
| |||
| |||
|
|
Yes, but this does not mean that it must always be in the same database. |
|
But what if the source systems are just two different databases dedicated to OLAP, let alone two different DBMSs? |
![]() |
| Thread Tools | |
| Display Modes | |
| |