views:

5227

answers:

8

Hello people!

My company is heavily invested in the MS BI Stack (SQL Server Reporting Services, -Analysis Services and -Integration Services), but I want to have a look at what the seemingly most talked about open-source alternative Pentaho is like.

I've installed a version, and I got it up and running quite painlessly. So that's good. But I haven't really the time to start using it for actual work to get a thorough understanding of the package.

Have any of you got any insights into what are the pros and cons of Pentaho vs MS BI, or any links to such comparisons?

Much appreciated!

+2  A: 

I can't offer any input on the MS BI Stack but at the most recent Barcamp Orlando, the folks from Pentaho were there and spoke about their products and it was an extremely impressive demo.

The fact that it's an Open Source project that you can extend yourself as well as a paid package for really good service leaves you with a lot of options. They demonstrated some paid work they did for a client and they definitely wow'd the crowd.

I also had a chance to chat a little bit with a developer working on the data warehousing side of things for Pentaho and he was extremely sharp and was very open to suggestions and had no problems answering any questions.

So as far as a company goes, Pentaho really impressed me with both their work and how friendly and approachable all of their developers were.

mwilliams
+18  A: 

I reviewed multiple Bi stacks while on a path to get off of Business Objects. A lot of my comments are preference. Both tool sets are excellent. Some things are how I prefer chocolate fudge brownie ice cream over plain chocolate.

Pentaho has some really smart guys working with them but Microsoft has been on a well funded and well planned path. Keep in mind MS are still the underdogs in the database market. Oracle is king here. To be competitive MS has been giving away a lot of goodies when you buy the database and have been forced to reinvent their platform a couple of times. I know this is not about the database, but the DB battle has cause MS to give away a lot in order to add value to their stack.

1.) Platform
SQL server doesn't run on Unix or Linux so they are automatically excluded from this market. Windows is about the same price as some versions or Unix now. Windows is pretty cheap and runs faily well now. It gives me about as much trouble as Linux.

2.) OLAP
Analysis services was reinvented in 2005 (current is 2008) over the 2000 version. It is an order of magnatude more powerful over 2000. The pentaho (Mondrian) is not as fast once you get big. It also has few features. It is pretty good but there are less in the way of tools. Both support Excel as the platform which is esscential. The MS version is more robust.

3.) ETL
MS - DTS has been replaced with SSIS. Again, order of magnatude increase in speed, power, and ability. It controls any and all data movement or program control. If it can't do it you can write a script in Powershell. On par with Informatica in the 2008 release. Pentaho - Much better than is used to be. Not as fast as I would like but I can do just about everything I want to do.

4.) dashboard
Pentaho has improved this. It is sort of uncomfortable and unfriendly to develop but there is really not a real equiv for MS.

5.) reports
MS reports is really powerful but not all that hard to use. I like it now but hated it at first, until I got to know it a little better. I had been using crystal reports and the MS report builder is much more powerful. It is easy to do hard things in MS, but a little harder to do easy things. Pentaho is a little clumsy. I didn't like it at all but you might. I found it to be overly complex. I wish it was either more like the Crystal report builder or the MS report builder but it is jasper like. I find is to be hard. That may be a preference.

6.) ad hoc
MS - this was the real winner for me. I tested it with my users an they instantly in love with the MS user report builder. What made the difference was how it was not just easy to use, but also productive. Pentaho - is good but pretty old school. It uses the more typical wizard based model and has powerful tools but I hate it. It is an excellent tool for what it is, but we have moved on from this style and no one wants to go back. Same problem I had with logiXML. The interface worked well for what it was but is not really much of a change from what we used 12 years. http://wiki.pentaho.com/display/PRESALESPORTAL/Methods+of+Interactive+Reporting

There are some experienced people out there that can make Pentaho really run well, I just found the MS suite to be more productive.

Stradas
Excellent and well written, thankyou!
Tomas
Stradas, you are providing great information to all the community. Well writen, your warnings about the things that might be inexact due your own preferences and the organization of the content provide a great reference point. Thanks for taking the time for this.
vmarquez
NagaMensh, can you be more specific? The learning curve is a little long compared to DTS but I find SSIS to be a well thought out tool. Informatica is the closest comparison. There are other good tools that are also easy to use but most of them are not as robust.
Stradas
A: 

If you are looking for a robust, low cost alternative to the big boys LogiXML has dashboarding and ad hoc reporting on a .NET platform. We've been using them since late 2006 when Pentaho was just starting, but I haven't looked at it in awhile.

+1  A: 

I started using MS Reporting Services many years ago and just love it. I've not tried Penaho's reporting solution so I can't comment on it. Nor have I tried either Analysis Services or Pentaho's alternative.

Recently I needed an ETL solution and being familiar with MSSQL and MSRS it seemed obvious that I would review and probably choose MS Integration Service. But for me, MSIS was awful. Mostly because it was not intuitive. After spending a couple of days trying to learn the tool I decided to look for an alternative and came across Pentaho Data Integration, formerly known as Kettle. I had it up and running within minutes and immediately created my first transformation. It just works.

Admittedly my needs are fairly simple but performance has been great and the community seems very helpful.

Gary B
+1  A: 

Warning -- there are numerous sites out there listing the numerous deficiencies, bugs, and annoyances with SSIS. Not sure why SSIS came out on top with the post -- but before you bet your project on it, look at what people have to say in the blogosphere. From my experience its about 20:1 ranting about how horrible SSIS is to work with--I can concur as well, currently looking for any alternative.

alchemical
SSIS has a very good (fast/powerful) engine, but has one of the shittiest designers I have worked with in my life. There are countless petty annoyances e.g. it has a RIGHT() function, but no LEFT()!
adolf garlic
A: 

Hi,

I was wondering what ETL tool you would recommend as middleware, ETL, to create data transformation between multiple relational databases, flat and other type of files. Ideally, we are looking for windows based software which is flexible enough to work with different databases and platforms should we switch to a different rdbms. I have used dts in the past and looked at Kettle demo. It seems like they do have similar. Can we use dts, if one of the databases is mysql? Are there any other prefered, easy to use packages?

Thanks,

Nore
A: 

Agree about SSIS. It's a nightmare.

jay
A: 

just recently tried pentaho open source BI, it was extremely clumsy

not very initiative at all!

development time took much longer

quite different from either Oracle or ms BI solutions thumb down, maybe the enterprise edition is better

q Tran