Why big data and social media monitoring will be big

I’m convinced that social media and big data are two of the really biggest things within Business Intelligence the next few years.  There are some good reasons for that:

  • Social media has reached a critical mass. Facebook and Twitter would be on the top 5 largest countries in the world list (based on population), if they were countries.
  • A lot of research progress has been made on automated analysing of blog postings and other forms of human created texts.
  • Data Mining is also becoming more and more widely available.
  • To stay competitive, companies must understand more about their customers and the world around them. The findings in social media will predict what you are going to see in your order books and even later in your financial results.
  • Organisations like United Nations have started to automatically monitor news and social media, with the goal of finding out about crisises in real time. See here: http://www.unglobalpulse.org/
  • The growing amounts of data requires increased computing power. However, we cannot any longer rely on hardware improvements to rescue us. In fact, CPU clock speed has stopped increasing 5 years ago. It is the end of the era of “don’t worry about CPU power”. Much like we now have to think about sustainable energy sources and using energy efficiently, we will need to build “sustainable software”.
  • Social media produces huge amounts of data.
  • This means that we have to scale out (use distributed computing) instead of scaling up. We need architectures that can handle very large amounts of parallelism.
  • Platforms for scaling out (distributed computing), like Hadoop, have been released as Open Source. On top of Hadoop has also been built new databases like Cassandra (with good scalability and fault tolerance), HBase (useful for very large tables) and Hive (a scalable data warehouse platform). These can run on cheap hardware and are freely available.
  • The worlds of traditional databases and big data are meeting. Microsoft has announced that they are integrating Hadoop and SQL Server 2012. A first preview will be available during 2012.
  • The cloud fits very well with social media monitoring and big data. The whole architecture behind the cloud is scaling out, which is what big data really is about. Also social media is very efficiently monitored from the cloud since the data is external anyway.

Is big data and social media monitoring only for the biggest companies and organisations? Certainly not. Even as a small organisation, you will want to look into the future instead of the history (like many current BI solutions that focus mainly on economics). With the cloud it will be affordable and available even to the small companies.

Still not convinced about the impact of big data and social media monitoring? Read this article:

http://www.nytimes.com/2012/02/12/sunday-review/big-datas-impact-in-the-world.html?_r=1

 

Leave a Comment

Filed under General, Technical

User group meeting with Thomas Kejser in Malmö on Tuesday 21 February

I’m very happy to announce that we will have Thomas Kejser speaking at the Swedish SQL Server User Group in Malmö on Tuesday 21 February. Thomas works for the SQLCAT team with the most challenging SQL Server and data warehouses in the world. He’s been speaking at the top international conferences, including PASS SQLRally Nordic, and is holder of the world record on ETL processing.

Thomas will be speaking about scalable data warehousing on SQL Server. This is a presentation you really don’t want to miss!

This seminar is free. More information on the link below. Registration is mandatory (since number of seats is limited).

http://sqlug20120221.eventbrite.com/

Leave a Comment

Filed under SQLug.se

SQL Server 2012 RC0 now available for download

SQL Server 2012 RC0 (Release Candidate 0), formerly known as SQL Server Denali, is now available for download.

SQL Server 2012 RC0 is the first release which is feature complete and represents an exciting milestone for customers and partners looking to experience the full value of SQL Server 2012.

Download your evaluation version and try the new features here!
http://www.microsoft.com/download/en/details.aspx?id=28145

RC0 is a production quality release that includes access to upgrade and migration tools like Upgrade Advisor, Distributed Replay and SQL Server Migration Assistant (SSMA). Upgrade Advisor and Distributed Replay allow you to perform thorough analysis and testing of your current SQL Server applications before upgrading so you know what to expect. You can also use SSMA to automate migrate non-SQL Server databases to SQL Server 2012.

2 Comments

Filed under Technical

Some resources on Power View

Here are some resources I can recommend for learning more about Power View (formerly known as codename “Crescent”).

Good luck!

Leave a Comment

Filed under Technical

PASS SQLRally Nordic statistics

Only two days left until PASS SQLRally Nordic 2011! When I first talked to PASS and Microsoft over a year ago, about running this conference in Sweden, I never believed this conference would grow this big. But as we got great support from PASS and Microsoft, the project quickly grew. Having this strong support from the people that runs the world’s largest SQL Server conference has helped us very much. The timing has also been very good with PASS’s plans for international growth and Microsoft about to launch SQL Server 2012.

We have nearly 500 people coming (including staff, technicians, etc). The event got fully booked 4 weeks before the launch, which I see as a really good sign of the interest! We could have sold more attendee tickets if there was space.

It’s great to see that we have 13 countries represented among the visitors. The largest group is, not very surprisingly, the Swedish. On second place comes Denmark.

Interesting about the marketing is that although we have send out over 100 000 email invitations, advertised in magazines and on web sites, the most efficient marketing has been word of mouth. Almost 50% of attendees say they heard about SQLRally from a friend/colleague. The second most common way to hear about us is through a user group.

The company with the largest numbers of attendees is, not very surprisingly, Microsoft. On second place comes SolidQ. What strikes me is that large companies are very underrepresented. Maybe smaller companies invest more in competence and their employees.

My gut feeling was that Business Intelligence is the most popular topic. This was confirmed by 53% of attendees choosing Business Intelligence as their primary track. Database development comes on second place with 34% and database administration on the third.

As usual, these conferences are very male dominated. Only about 13% of attendees are women. I wonder if this is representative for the percentage of women working with SQL Server and Business Intelligence.

Equally important as the technical contents of a conference is the social part. We’ve really invested heavily here and hired E-type and a group of vikings to give a Nordic touch to SQLRally.

Looking forward to see you at SQLRally. I believe this event will be something to really remember for a long time!

Leave a Comment

Filed under Technical

New SQL Server: “Denali” is 2012 and “Crescent” is Power View

Microsoft announced at PASS Summit in Seattle this week that the next version of SQL Server will be named SQL Server 2012. It will be released first half of year 2012. A feature complete release candidate (RC0), where everyone can try out the new features, will be released before the end of 2011.

Microsoft also announced that the new BI self-service reporting tool, codename “Crescent”, will be given the official name Power View. Power View is a highly interactive data exploration tool and a preview is already available with the CTP3 of Denali.

The PASS Summit keynotes are available on the PASS website here.

The CTP3 of SQL Server 2012 (“Denali”) can be downloaded here:
https://www.microsoft.com/betaexperience/pd/SQLDCTP3CTA/enus/default.aspx

Leave a Comment

Filed under Technical

New family member

My greatest moment in life since Alice arrived… Say hello to her little baby sister!

 

2 Comments

Filed under General

InstantCube – autogenerating SSAS cubes

When I started programming around 1983, even writing small programs was a time consuming process. I used paper and pen together with a list of CPU instruction codes to manually compile my programs and then had to load them into memory using POKE statements in the built-in BASIC interpreter. If the program hung, which it often did, I had to restart the computer.

Today developers are getting more productive with languages and tools that enable us to work at a higher level. Some of us never liked doing manual repetitive tasks, so we wrote programs that did the job for us. This is exactly what got me started into autogenerating SSAS cubes (and ETL-packages). My first tool that I wrote in 2006 read its metadata from Excel spreadsheets and created relational tables, SSIS packages and SSAS cubes. It helped me create and manage a lot of cubes and ETL packages that were similar and to respond quickly to changing user needs. Since then I have developed a few tools for autogenerating SSAS cubes, measures, dimensions, KPIs, etc.

Why is this so time-saving? Have a look at the procedures below comparing manual v s using the InstantCube tool.

Option 1: Typical procedure for adding a dimension manually

Option 2: Procedure for adding a dimension using InstantCube

Some sample screenshots form InstantCube is shown here.

Application screenshot

Resulting cube

Resulting relational database

To learn more about autogenerating Analysis Services cubes I can recommend the following sources:

  • You can get an overview about how InstantCube works by reading chapter 53 (Autogenerating Analysis Services Cubes) in MVP Deep Dives 2.
  • I have published InstantCube as an open source project on CodePlex. You’ll learn much from the source code.
  • The AMO (Analysis Services Management Objects) documentation.
  • Looking at the XML Scripts you can generate from SSAS objects in Management Studio.

 

Leave a Comment

Filed under Technical

I have joined MEET – Microsoft Extended Expert Team

MEET consists of independent technical experts on Microsoft technologies that share their knowledge on conferences, seminars, blogs and on webcasts. If you are looking for the anwer to complicated technical problems or are looking to increase your knowledge you can contact these people.

MEET, Microsoft Extended Expert Team, is organized by Microsoft Sweden. Keep your eyes open for our activities on my blog, the MSDN and TechNet newsletters. Through our sessions and workshops you will learn to make better decisions and think in new ways.

Visit the MEET website here (in Swedish).

1 Comment

Filed under General, Technical

SQL Server MVP Deep Dives II Book launched

I’m happy to announce that the second SQL Server MVP Deep Dives book has now been launched. It will be available at PASS Global Summit where you also can meet many of the authors for book signing.

There are very good reasons you should buy this book:

  • It is one of the most valuable books on the SQL Server platform written by some of the greatest SQL Server experts in the world.
  • It’s an opportunity to give back. The royalties from the book goes to support charity (Operation Smile).

It contains 60 chapters written by several of the Microsoft SQL Server MVPs. Most of these industry leaders write their own books for other publishers and have collaborated to produce this unique book with all profits donated to charity. This effort was led by Kalen Delaney, Paul Randal, Kimberly Tripp, Greg Low, Brad McGehee and Paul Nielsen.

I was privileged to write a chapter about Autogenerating Analysis Services Cubes (Chapter 53).

This book is available from book stores around the world. It can also be ordered directly from Manning through this link:
http://www.manning.com/delaney/

1 Comment

Filed under Technical