A warning about SSIS Foreach Loop container to process files from folder
SSIS Foreach Loop Container is frequently used to process files from specific folder. If the names of the files are not known beforehand then the usual way is to process all the files with specific...
View ArticleEstimating SQL Server Table Sizes
Originally posted on LichtenBytes: Microsoft gives guidance for calculating the estimated sizes of your tables and indexes at http://msdn.microsoft.com/en-us/library/ms175991.aspx. Performing such...
View ArticleWeb scraping in R using rVest
I am not much conversant with web scraping but I undersand the importance of the technique given the fact that a lot of very useful data is embedded in HTML pages. Hence I was very excited when I came...
View Article2014 in review
The WordPress.com stats helper monkeys prepared a 2014 annual report for this blog. Here's an excerpt: The concert hall at the Sydney Opera House holds 2,700 people. This blog was viewed about 11,000...
View ArticleData Virtualization in Business Intelligence
A long time ago I wrote a blog post where I described three approaches to providing operational reports and compared them against each other. They are 1) traditional DW approach, 2) the abstract views...
View ArticleCreating a new SSIS package? Have you thought about these things?
Creating a package in SSIS is easy but creating a “good” SSIS package is a different story. As developers, we tend to jump right into building and creating that wonderfully simple package and often...
View ArticleKnight’s Move Puzzle in TSQL
A colleague at work set us an interesting puzzle. The puzzle is a generic programming puzzle but we were required to solve it with TSQL. The person, who can come up with fastest solution, wins. Here’s...
View ArticleSQL Server 2016
SQL Server 2016 was announced in Summer 2015 and CTP is available since then. It comes with some really cool features; JSON support, Polybase and Temporal Tables are some of my favorites. SSRS has seen...
View ArticleBI is Dead, Long live BI
Timo Elliot (blog|twitter) recently published a blog BI is Dead which draws from Gartner’s Magic Quadrant report and a detailed report. The main take away from the post (which includes references from...
View ArticleZen of data modelling in Hadoop
Zen of data modelling in Hadoop The Zen of Python is well known tongue-in-cheek guidelines to writing Python code. If you haven’t read it, I would highly recommend reading it here Zen of Python....
View Article