Posted: 12/19/2011
I've just started a new job where Transactional Replication is used as an ETL method for loading data from source databases in the Data Warehouse. The source data is prevented from being loaded into the Data Warehouse immediately by changing the schedule of the Distribution Agent so that it only runs at a certain times.
I would like to replace this process with either SSIS packages or Change Data Capture (CDC), but I am facing resistance from various quarters. I think it's a case of "we've always done it this way" and I'm struggling to convince people to change the ETL process. What are folks' thoughts on the best way to approach this?
Thanks
Lempster
I'm going to play a bit of "Devil's advocate" here and ask, "Why would you want to change it if it IS working?"
Don't get me wrong there are some good reasons to change it, but from the employer's point of view, they are probably going, "Why go through work, testing, verification, etc - when it works the way it is?" - or something along that line.
A few questions about the system:
Looking forward to your answers,
Keith Hyer
Posted: 12/20/2011
Hi Keith,
Thanks for your response. You are right in that it is working, but I believe it could work better - faster, less resource intensive, more flexible - by using one of the technologies I mentioned. To answer your questions:
A lot of transformations take place on the data that is loaded in to the data warehouse via transactional replication; in my opinion it would be better to do that as close to the source as possible, i.e. within one or more OLEDB Source components of an SSIS package.
CDC is not an option currently as the shop is SQL Server 2005, but an upgrade to 2008 R2 is planned for the first quarter of 2012.
Thanks and regards
Posted: 1/11/2012
@Keith,
Any further thoughts?
Regards