ansaurus

Question

C# approach to mapping input files dynamically

Answer 1

+1 A:

Your parsers are going to have to know about your columns....otherwise it is unable to map the data to the specific object properties. Unless of course you introduce an indexed-properties class which you could store the information based on the order it is read.

You should create a parser factory and based on the extension of the file you would return the correct parser for the job e.g.

public class Record 
{ 
    private Dictionary<int, string> items = new Dictionary<int, string>(); 
    private int propCount; 

    public Record(int size) 
    { 
        // populate array with empty strings 
        for(int i = 0; i <= size -1; i++) 
            items.Add(i, String.Empty); 
        propCount = size; 
    } 

    public string this[int index] 
    { 
        get { return items[index]; } 
        set { items[index] = value; } 
    } 

    public int PropertyCount { get { return propCount; } } 
}

public interface IRecordParser 
{ 
    string FileName { get; set; } 
    string[] GetHeadings(); 
    bool HasHeaders { get; set; } 
    void GoToStart(); 
    Record ParseNextRecord(); 
} 

public abstract class RecordParser 
{ 
    public string FileName { get; set; } 
    public bool HasHeaders { get; set; } 
    public abstract string[] GetHeadings(); 
    public abstract void GoToStart(); 
    public abstract Record ParseNextRecord();     
} 

public class ExcelRecordParser : RecordParser, IRecordParser 
{ 
    public ExcelRecordParser() 
    { 
    } 

    public override string[] GetHeadings() 
    { 
        if (HasHeaders)  
            // return column headings
        else 
            // return default headings from settings file
    } 

    public override void GoToStart() 
    { 
        // navigate to first row (or +1 if HasHeaders is true) 
    } 

    public override Record ParseNextRecord() 
    { 
        var headers = GetHeadings(); 
        var r = new Record(headers.Length);

        // enumerate rows, then for each row do...
        for(int i = 0; i <= headers.Length - 1; i++) 
            r[i] = row[i];

        return r;
    } 
}

public class CsvRecordParser : RecordParser, IRecordParser 
{ 
    public CsvRecordParser() 
    { 
    } 

    public override string[] GetHeadings() 
    { 
        if (HasHeaders) 
            // return first row split as headings
        else 
            // return default headers from settings file
    } 

    public override void GoToStart() 
    { 
        // navigate to start of file (or +1 if HasHeaders is true) 
    } 

    public override Record ParseNextRecord() 
    { 
        var headers = GetHeadings(); 
        var r = new Record(headers.Length);

        // enumerate lines, then for each line do...
        for(int i = 0; i <= headers.Length - 1; i++) 
            r[i] = line[i];

        return r;
    } 
} 

public static class RecordParserFactory 
{ 
    public static IRecordParser Create(string ext) 
    { 
        switch (ext) 
        { 
            case ".xls": 
                return new ExcelRecordParser() as IRecordParser; 
            case ".csv": 
                return new CsvRecordParser() as IRecordParser;
            default:
                return null;
        } 
    } 
}

Usage

// would return an instance of CSV Parser
string file = @"C:\Data\MyRecords.csv";
IRecordParser parser = RecordParserFactory.Create(System.IO.Path.GetExtension(file));

// would return an instance of Excel Parser
string file = @"C:\Data\MyRecords.xls";
IRecordParser parser = RecordParserFactory.Create(System.IO.Path.GetExtension(file));

This would allow you to add other parsers if your file format changes in the future e.g. XML, Binary etc

James 2010-02-08 12:00:01

Hi James, yep this is the part I am happy with and have implemented something very similar. I think I have been trying to over complicate it, indirectly. I want to allow the business to ensure they can dynamically load new data by setting the columns and mappings them selves. But the business logic can / should not change, so I think you have put me down the right path to a good solution. Thanks :)

Jon 2010-02-08 12:20:49

@Jon, happy to help. It is always hard trying to parse dynamic information however, I would say your system does need to know at least *something* about the data it is parsing otherwise it would be impossible for it to map it. Unless of course it was purely down to the order in which the data is stored, in that case you could probably use reflection or even update your class to support indexed properties. I will update my answer to show you what I mean.

James 2010-02-08 12:26:55

@Jon, see my updated answer it now includes an indexed-properties based class. Basically that allows you to have as many properties as you want and it will always populate based on the order of the information. You could easily update the Record class to take in a list of headings instead of a size and then get the information based on the headings. However, to be as generic as possible, stick with indexes.

James 2010-02-08 12:59:41

thanks James. Very helpful indeed. Most grateful. :) That really clarifies / solidifies my thinking and nailed the approach.

Jon 2010-02-08 13:03:16

ansaurus

tags:

views:

answers:

C# approach to mapping input files dynamically

related questions