![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| |||
| |||
|
#2
| |||
| |||
|
|
I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
#3
| |||
| |||
|
|
I recommand parititions! the work required to "merge" these cubes into 1 virtual cube and create calculated measures which aggregate the 450 measures of the 450 cubes is big. also, with cubes you have 450 cubes to manage. 1 change = 450 changes !!! also, with a virtual cube in top of your cubes, you can have a problem if 1 or more cubes is not processed. For example, if you change 1 dimension (or you do a full process of 1 dimension), then you have to reprocess all cubes before the virtual cube can be used. With the partition option you must reprocess all partitions, but after the creation of the first partition, the cube is accessible to the end user. (Well only 1 district accessible at this time, but its better then nothing) in another way, if you have no plan or no measure across these 450 districts ("all districts" not available and no district comparison needed) then you can have 450 cubes whitout any virtual cube and each user must select the cube to use. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:OlxB8ElgFHA.1148 (AT) TK2MSFTNGP12 (DOT) phx.gbl... I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
#4
| |||
| |||
|
|
Absolutely agree. Use partitioning. There is a discussion of partitioning in the SQL Server 2000 Resource Kit book, and here: http://www.microsoft.com/resources/d...rt6/c2661.mspx (in the section titled Partition Strategy) And here: (AS Performance Guide) http://www.microsoft.com/technet/pro.../ansvcspg.mspx (all over the place) And here: (AS Operations Guide) http://www.microsoft.com/technet/pro.../anservog.mspx in the section titled: Performance Configuration Issues (among other sections) -- Dave Wickert [MSFT] dwickert (AT) online (DOT) microsoft.com Program Manager BI SystemsTeam SQL BI Product Unit (Analysis Services) -- This posting is provided "AS IS" with no warranties, and confers no rights. "Jéjé" <willgart_A_ (AT) hotmail_A_ (DOT) com> wrote in message news:Op7NnMlgFHA.2472 (AT) TK2MSFTNGP15 (DOT) phx.gbl... I recommand parititions! the work required to "merge" these cubes into 1 virtual cube and create calculated measures which aggregate the 450 measures of the 450 cubes is big. also, with cubes you have 450 cubes to manage. 1 change = 450 changes !!! also, with a virtual cube in top of your cubes, you can have a problem if 1 or more cubes is not processed. For example, if you change 1 dimension (or you do a full process of 1 dimension), then you have to reprocess all cubes before the virtual cube can be used. With the partition option you must reprocess all partitions, but after the creation of the first partition, the cube is accessible to the end user. (Well only 1 district accessible at this time, but its better then nothing) in another way, if you have no plan or no measure across these 450 districts ("all districts" not available and no district comparison needed) then you can have 450 cubes whitout any virtual cube and each user must select the cube to use. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:OlxB8ElgFHA.1148 (AT) TK2MSFTNGP12 (DOT) phx.gbl... I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
#5
| |||
| |||
|
|
Thank you very much for the info. My final step is to implement a changing dimension (not sure if I should go shared or private) on the cube to allow me to process specific partitions based on changes to the dimension. I have read that member keys must be unique. It will have a Parent(Class) and a child(Student) level. Does this mean I cannot have Students in more than one class? Can anyone provide me with any links providing more detailed descriptions and implementations of changing dimensions? I am having some difficulty understanding how changing dimensions work and require processing(the cube and dimension). thanks again, tim "Dave Wickert [MSFT]" <dwickert (AT) online (DOT) microsoft.com> wrote in message news:%232$ZbingFHA.3540 (AT) TK2MSFTNGP14 (DOT) phx.gbl... Absolutely agree. Use partitioning. There is a discussion of partitioning in the SQL Server 2000 Resource Kit book, and here: http://www.microsoft.com/resources/d...rt6/c2661.mspx (in the section titled Partition Strategy) And here: (AS Performance Guide) http://www.microsoft.com/technet/pro.../ansvcspg.mspx (all over the place) And here: (AS Operations Guide) http://www.microsoft.com/technet/pro.../anservog.mspx in the section titled: Performance Configuration Issues (among other sections) -- Dave Wickert [MSFT] dwickert (AT) online (DOT) microsoft.com Program Manager BI SystemsTeam SQL BI Product Unit (Analysis Services) -- This posting is provided "AS IS" with no warranties, and confers no rights. "Jéjé" <willgart_A_ (AT) hotmail_A_ (DOT) com> wrote in message news:Op7NnMlgFHA.2472 (AT) TK2MSFTNGP15 (DOT) phx.gbl... I recommand parititions! the work required to "merge" these cubes into 1 virtual cube and create calculated measures which aggregate the 450 measures of the 450 cubes is big. also, with cubes you have 450 cubes to manage. 1 change = 450 changes !!! also, with a virtual cube in top of your cubes, you can have a problem if 1 or more cubes is not processed. For example, if you change 1 dimension (or you do a full process of 1 dimension), then you have to reprocess all cubes before the virtual cube can be used. With the partition option you must reprocess all partitions, but after the creation of the first partition, the cube is accessible to the end user. (Well only 1 district accessible at this time, but its better then nothing) in another way, if you have no plan or no measure across these 450 districts ("all districts" not available and no district comparison needed) then you can have 450 cubes whitout any virtual cube and each user must select the cube to use. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:OlxB8ElgFHA.1148 (AT) TK2MSFTNGP12 (DOT) phx.gbl... I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
#6
| |||
| |||
|
|
Thank you very much for the info. My final step is to implement a changing dimension (not sure if I should go shared or private) on the cube to allow me to process specific partitions based on changes to the dimension. I have read that member keys must be unique. It will have a Parent(Class) and a child(Student) level. Does this mean I cannot have Students in more than one class? Can anyone provide me with any links providing more detailed descriptions and implementations of changing dimensions? I am having some difficulty understanding how changing dimensions work and require processing(the cube and dimension). thanks again, tim "Dave Wickert [MSFT]" <dwickert (AT) online (DOT) microsoft.com> wrote in message news:%232$ZbingFHA.3540 (AT) TK2MSFTNGP14 (DOT) phx.gbl... Absolutely agree. Use partitioning. There is a discussion of partitioning in the SQL Server 2000 Resource Kit book, and here: http://www.microsoft.com/resources/d...rt6/c2661.mspx (in the section titled Partition Strategy) And here: (AS Performance Guide) http://www.microsoft.com/technet/pro.../ansvcspg.mspx (all over the place) And here: (AS Operations Guide) http://www.microsoft.com/technet/pro.../anservog.mspx in the section titled: Performance Configuration Issues (among other sections) -- Dave Wickert [MSFT] dwickert (AT) online (DOT) microsoft.com Program Manager BI SystemsTeam SQL BI Product Unit (Analysis Services) -- This posting is provided "AS IS" with no warranties, and confers no rights. "Jéjé" <willgart_A_ (AT) hotmail_A_ (DOT) com> wrote in message news:Op7NnMlgFHA.2472 (AT) TK2MSFTNGP15 (DOT) phx.gbl... I recommand parititions! the work required to "merge" these cubes into 1 virtual cube and create calculated measures which aggregate the 450 measures of the 450 cubes is big. also, with cubes you have 450 cubes to manage. 1 change = 450 changes !!! also, with a virtual cube in top of your cubes, you can have a problem if 1 or more cubes is not processed. For example, if you change 1 dimension (or you do a full process of 1 dimension), then you have to reprocess all cubes before the virtual cube can be used. With the partition option you must reprocess all partitions, but after the creation of the first partition, the cube is accessible to the end user. (Well only 1 district accessible at this time, but its better then nothing) in another way, if you have no plan or no measure across these 450 districts ("all districts" not available and no district comparison needed) then you can have 450 cubes whitout any virtual cube and each user must select the cube to use. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:OlxB8ElgFHA.1148 (AT) TK2MSFTNGP12 (DOT) phx.gbl... I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
#7
| |||
| |||
|
#8
| |||
| |||
|
|
in a changing dimension the lowest level must have a unique key. (the student key in your case) And yes, the student is in only 1 class in a changing dimension. If you want to have multiple classes by student you have to create a new student (1 student by class) but not so good because you'll count twice the same student. or you have to create 2 separate dimensions 1 for classes 1 for students "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:%23GgfmvygFHA.2904 (AT) tk2msftngp13 (DOT) phx.gbl... Thank you very much for the info. My final step is to implement a changing dimension (not sure if I should go shared or private) on the cube to allow me to process specific partitions based on changes to the dimension. I have read that member keys must be unique. It will have a Parent(Class) and a child(Student) level. Does this mean I cannot have Students in more than one class? Can anyone provide me with any links providing more detailed descriptions and implementations of changing dimensions? I am having some difficulty understanding how changing dimensions work and require processing(the cube and dimension). thanks again, tim "Dave Wickert [MSFT]" <dwickert (AT) online (DOT) microsoft.com> wrote in message news:%232$ZbingFHA.3540 (AT) TK2MSFTNGP14 (DOT) phx.gbl... Absolutely agree. Use partitioning. There is a discussion of partitioning in the SQL Server 2000 Resource Kit book, and here: http://www.microsoft.com/resources/d...rt6/c2661.mspx (in the section titled Partition Strategy) And here: (AS Performance Guide) http://www.microsoft.com/technet/pro.../ansvcspg.mspx (all over the place) And here: (AS Operations Guide) http://www.microsoft.com/technet/pro.../anservog.mspx in the section titled: Performance Configuration Issues (among other sections) -- Dave Wickert [MSFT] dwickert (AT) online (DOT) microsoft.com Program Manager BI SystemsTeam SQL BI Product Unit (Analysis Services) -- This posting is provided "AS IS" with no warranties, and confers no rights. "Jéjé" <willgart_A_ (AT) hotmail_A_ (DOT) com> wrote in message news:Op7NnMlgFHA.2472 (AT) TK2MSFTNGP15 (DOT) phx.gbl... I recommand parititions! the work required to "merge" these cubes into 1 virtual cube and create calculated measures which aggregate the 450 measures of the 450 cubes is big. also, with cubes you have 450 cubes to manage. 1 change = 450 changes !!! also, with a virtual cube in top of your cubes, you can have a problem if 1 or more cubes is not processed. For example, if you change 1 dimension (or you do a full process of 1 dimension), then you have to reprocess all cubes before the virtual cube can be used. With the partition option you must reprocess all partitions, but after the creation of the first partition, the cube is accessible to the end user. (Well only 1 district accessible at this time, but its better then nothing) in another way, if you have no plan or no measure across these 450 districts ("all districts" not available and no district comparison needed) then you can have 450 cubes whitout any virtual cube and each user must select the cube to use. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:OlxB8ElgFHA.1148 (AT) TK2MSFTNGP12 (DOT) phx.gbl... I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
#9
| |||
| |||
|
|
Exensive discussion in the AS Performance Guide http://www.microsoft.com/technet/pro.../ansvcspg.mspx And some minor ones in the AS Operations Guide http://www.microsoft.com/technet/pro.../anservog.mspx -- Dave Wickert [MSFT] dwickert (AT) online (DOT) microsoft.com Program Manager BI SystemsTeam SQL BI Product Unit (Analysis Services) -- This posting is provided "AS IS" with no warranties, and confers no rights. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:%23GgfmvygFHA.2904 (AT) tk2msftngp13 (DOT) phx.gbl... Thank you very much for the info. My final step is to implement a changing dimension (not sure if I should go shared or private) on the cube to allow me to process specific partitions based on changes to the dimension. I have read that member keys must be unique. It will have a Parent(Class) and a child(Student) level. Does this mean I cannot have Students in more than one class? Can anyone provide me with any links providing more detailed descriptions and implementations of changing dimensions? I am having some difficulty understanding how changing dimensions work and require processing(the cube and dimension). thanks again, tim "Dave Wickert [MSFT]" <dwickert (AT) online (DOT) microsoft.com> wrote in message news:%232$ZbingFHA.3540 (AT) TK2MSFTNGP14 (DOT) phx.gbl... Absolutely agree. Use partitioning. There is a discussion of partitioning in the SQL Server 2000 Resource Kit book, and here: http://www.microsoft.com/resources/d...rt6/c2661.mspx (in the section titled Partition Strategy) And here: (AS Performance Guide) http://www.microsoft.com/technet/pro.../ansvcspg.mspx (all over the place) And here: (AS Operations Guide) http://www.microsoft.com/technet/pro.../anservog.mspx in the section titled: Performance Configuration Issues (among other sections) -- Dave Wickert [MSFT] dwickert (AT) online (DOT) microsoft.com Program Manager BI SystemsTeam SQL BI Product Unit (Analysis Services) -- This posting is provided "AS IS" with no warranties, and confers no rights. "Jéjé" <willgart_A_ (AT) hotmail_A_ (DOT) com> wrote in message news:Op7NnMlgFHA.2472 (AT) TK2MSFTNGP15 (DOT) phx.gbl... I recommand parititions! the work required to "merge" these cubes into 1 virtual cube and create calculated measures which aggregate the 450 measures of the 450 cubes is big. also, with cubes you have 450 cubes to manage. 1 change = 450 changes !!! also, with a virtual cube in top of your cubes, you can have a problem if 1 or more cubes is not processed. For example, if you change 1 dimension (or you do a full process of 1 dimension), then you have to reprocess all cubes before the virtual cube can be used. With the partition option you must reprocess all partitions, but after the creation of the first partition, the cube is accessible to the end user. (Well only 1 district accessible at this time, but its better then nothing) in another way, if you have no plan or no measure across these 450 districts ("all districts" not available and no district comparison needed) then you can have 450 cubes whitout any virtual cube and each user must select the cube to use. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:OlxB8ElgFHA.1148 (AT) TK2MSFTNGP12 (DOT) phx.gbl... I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
#10
| |||
| |||
|
|
That totally kills me. I have to be able to report on this data in real time, AND I must be able to put a student in multiple classes. I have over 64,000 students in the system, therefore I cannot create a 'Student' Dimension because it breaks the 64,000 child limit. It is really beginning to look like I may need to create District specific Cubes. I will keep prototyping and let you all know what I do. Any other ideas out there? Thanks for all the input "Jéjé" <willgart_A_ (AT) hotmail_A_ (DOT) com> wrote in message news:uov1zG1gFHA.2444 (AT) tk2msftngp13 (DOT) phx.gbl... in a changing dimension the lowest level must have a unique key. (the student key in your case) And yes, the student is in only 1 class in a changing dimension. If you want to have multiple classes by student you have to create a new student (1 student by class) but not so good because you'll count twice the same student. or you have to create 2 separate dimensions 1 for classes 1 for students "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:%23GgfmvygFHA.2904 (AT) tk2msftngp13 (DOT) phx.gbl... Thank you very much for the info. My final step is to implement a changing dimension (not sure if I should go shared or private) on the cube to allow me to process specific partitions based on changes to the dimension. I have read that member keys must be unique. It will have a Parent(Class) and a child(Student) level. Does this mean I cannot have Students in more than one class? Can anyone provide me with any links providing more detailed descriptions and implementations of changing dimensions? I am having some difficulty understanding how changing dimensions work and require processing(the cube and dimension). thanks again, tim "Dave Wickert [MSFT]" <dwickert (AT) online (DOT) microsoft.com> wrote in message news:%232$ZbingFHA.3540 (AT) TK2MSFTNGP14 (DOT) phx.gbl... Absolutely agree. Use partitioning. There is a discussion of partitioning in the SQL Server 2000 Resource Kit book, and here: http://www.microsoft.com/resources/d...rt6/c2661.mspx (in the section titled Partition Strategy) And here: (AS Performance Guide) http://www.microsoft.com/technet/pro.../ansvcspg.mspx (all over the place) And here: (AS Operations Guide) http://www.microsoft.com/technet/pro.../anservog.mspx in the section titled: Performance Configuration Issues (among other sections) -- Dave Wickert [MSFT] dwickert (AT) online (DOT) microsoft.com Program Manager BI SystemsTeam SQL BI Product Unit (Analysis Services) -- This posting is provided "AS IS" with no warranties, and confers no rights. "Jéjé" <willgart_A_ (AT) hotmail_A_ (DOT) com> wrote in message news:Op7NnMlgFHA.2472 (AT) TK2MSFTNGP15 (DOT) phx.gbl... I recommand parititions! the work required to "merge" these cubes into 1 virtual cube and create calculated measures which aggregate the 450 measures of the 450 cubes is big. also, with cubes you have 450 cubes to manage. 1 change = 450 changes !!! also, with a virtual cube in top of your cubes, you can have a problem if 1 or more cubes is not processed. For example, if you change 1 dimension (or you do a full process of 1 dimension), then you have to reprocess all cubes before the virtual cube can be used. With the partition option you must reprocess all partitions, but after the creation of the first partition, the cube is accessible to the end user. (Well only 1 district accessible at this time, but its better then nothing) in another way, if you have no plan or no measure across these 450 districts ("all districts" not available and no district comparison needed) then you can have 450 cubes whitout any virtual cube and each user must select the cube to use. "Tim Chapla" <tchapla (AT) nospam (DOT) turnleaf.com> wrote in message news:OlxB8ElgFHA.1148 (AT) TK2MSFTNGP12 (DOT) phx.gbl... I am looking for someone who can give some advice on creating partitions/cubes. Here is my situation: I have a fact table with 80 million rows (results for kids in a state). I create a giant cube with all the results and it takes too long to process. (I have the need to process this data a couple times a day) I am researching creating several cubes (450, one per district) VS creating several partitions (450, one per district) for the same cube. It is my understanding that I can process partitions separate from one another which would allow me to do more of a selective update on the cube. Anyone out there have any advice or resources they could point me to which would assist me in making this decision? tx, tim |
![]() |
| Thread Tools | |
| Display Modes | |
| |