views:

486

answers:

4

I was wondering if anyone knew how to quickly detect whether there is data in a given worksheet or not, without actually looping through all of the rows/columns of the worksheet to figure this out.

I am writing an importer that will import data right into the active worksheet (if it has not been modified), or create a new worksheet and import into it instead. I am currently looping through the entire sheet, and there is some noticeable lag-time in my import.

I would appreciate any help in the matter. Thank you!

A: 

This should be pretty fast:

    private void CheckForContent()
    {
        Worksheet activeSheet = ActiveSheet;
        var range = activeSheet.get_Range("A1", GetExcelColumnName(activeSheet.Columns.Count)+activeSheet.Rows.Count.ToString() );
        range.Select();
        range.Copy();
        string text = Clipboard.GetText().Trim();
        if(string.IsNullOrEmpty(text))
        {
            MessageBox.Show("No text");
        }
    }

    private string GetExcelColumnName(int columnNumber)
    {
        int dividend = columnNumber;
        string columnName = String.Empty;
        int modulo;

        while (dividend > 0)
        {
            modulo = (dividend - 1) % 26;
            columnName = Convert.ToChar(65 + modulo).ToString() + columnName;
            dividend = (int)((dividend - modulo) / 26);
        }
        return columnName;
    }
Mikael Svenson
+1  A: 

To avoid looping and take advantage of nearly instantaneous execution speed, you can use the Excel.WorksheetFunction.CountA method, which returns the same result as the =CountA() worksheet function.

Assuming that your Excel.Application reference is named 'excelApp' and your Excel.Worksheet reference is named 'worksheet', you can use code like the following in C# 4.0:

// C# 4.0
int dataCount = (int)excelApp.WorksheetFunction.CountA(worksheet.Cells);

if (dataCount == 0)
{
    // All cells on the worksheet are empty.
}
else
{
    // There is at least one cell on the worksheet that has non-empty contents.
}

In C# 3.0 and below, it's a bit more verbose, because you have to explicitly provide the missing optional arguments:

// C# 3.0 and below
int dataCount = (int)excelApp.WorksheetFunction.CountA(
    worksheet.Cells, 
    Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, 
    Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, 
    Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, 
    Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing, 
    Type.Missing, Type.Missing, Type.Missing, Type.Missing, Type.Missing);

if (dataCount == 0)
{
    // All cells on the worksheet are empty.
}
else
{
    // There is at least one cell on the worksheet that has non-empty contents.
}

I think this should do it for you!

Mike

Mike Rosenblum
A: 

I found the following solution, which is also instantaneous but I'm not sure how accurate it is... it has passed all my tests thus far.

Here it is for anyone who wants to know:

Worksheet sheet = (Worksheet)this.Application.ActiveSheet;
Range usedRange = sheet.UsedRange;
bool isUsed = (usedRange.Count > 1);
if (usedRange.Count == 1)
{
  isUsed = (usedRange.Value2 != null) &&
           (!string.IsNullOrEmpty(usedRange.Value2.ToString()));
}

if(isUsed)
{
  // worksheet cells not empty
}

I suppose this is a lot simpler than blowing up the clipboard every time I do the check or counting all the non-empty cells in the worksheet. Thanks Mikael and Mike, I appreciate both your answers.

Mohamed Nuur
momad, your approach is not bad, but it is (a) more work, and (b) will incorrectly report some worksheets as containing data when it does not, because the UsedRange can be affected by such things as formatting in the cells, not just values. If you make use of the `Application.WorksheetFunction.CountA` method, instead, you would pickup all data and *only* data in one line, instead of five.
Mike Rosenblum
Both are much better than using the clipboard for sure. The clipboard way will also take into account cells with "white space", which the other ways won't, but that's a special case :)
Mikael Svenson
Mike, I didn't know about the formatting. If that's the case, then I definately agree that the worksheet function CountA is a better approach. However, in my specific scenario, I don't want to want to override a cell even if the change is only a formatting change. Thanks for the clarification!
Mohamed Nuur
+1  A: 

Hi momad,

I work with VSTO and Excel for some time, at a very intense level, so I hope I can share things I've learned over the course of time with you.

Based on the information you've provided, I'd recommend casting to an object array and working with that information instead. Basically you'd access the values as:

object[,] arrayValues = (object[,])ExcelRange.Value2;

arrayValues is a 2D array ([row,column]). Excel populates the array blazingly fast, and of course operations on the array will be very performant (don't worry about the performance of the boxing, it's NOT an issue, believe me).

HTH, James

code4life
Thanks James,I am working on a VSTO add-in for excel and often run into issues that can be quickly answered by someone in the know, but time consuming to find on the internet. I'll be sure to make more use of stackoverflow since there's so many VSTO experts out here!
Mohamed Nuur