ansaurus

Question

Answer 1

+1 A:

This is a link to a MSDN page that use LINQ to DataSet to obtain the rows that appears in both datatables. This example use Intersect. I think you could modify it using except instead. I don't know if the performance will be better or not.

Jonathan 2009-06-26 07:10:17

I am on .NET 2.0.

2009-06-26 10:54:51

Answer 2

A:

private IEnumerator GetEnumerator( DataTable dtRequired, DataTable dtResponse ) { foreach( DataRow row in dtResponse.Rows ) { // use the columns of the primary key below if( dtResult.Rows.Contains( new object[] { row[0], row[2], row[4] } ) ) continue; else yield return row.ItemArray;

    }
}

private void GetComplement( DataTable dtRequired, DataTable dtResponse, out DataTable dtResult )
{
    DataTable dtResult = dtRequired.Clone();

    foreach( object[] items in GetEnumerator( dtRequired, dtResponse ) )
    {
        dtResult.Rows.Add( items );
    }

    return;
}

maxwellb 2009-06-26 09:29:42

There may some syntactic closing to be done with IEnumerator<T> vs. IEnumerable<T>.

maxwellb 2009-06-26 09:34:30

Answer 3

A:

You said that your looped Find() method is less efficient than Approach 1 http://weblogs.sqlteam.com/davidm/archive/2004/01/19/739.aspx.
I've seen people talk about ADO.NET 3.5 and LINQ, presuming you have a production LINQ to or use an iterative method to populate some generic container.
I wonder if creative usage of a HashTable would just happen to be faster, computationally (real world, not theory). In the case of Diff(tbl1,tb2) simply populate the hash with tbl2 then iteratively add the tbl1 members. For each success also add a copy of the member to the output (difference) array to be displayed/returned. For each failure, obviously, it already exists, so don't output/return that value.

Let me know, I'll rework my code if you confirm 3 is fastest. I'm comparing a DirectoryServices.FindAll() Collection to a SqlDataReader() and LINQ to Active Directory is in 3rd Party Beta, I guess. So I need a 'production' approved method here that is as efficient as possible @ 15,000 objects.

2009-09-28 03:05:01

ansaurus

tags:

views:

answers:

difference between 2 datatables

related questions