ansaurus

Question

Design pattern for class with upwards of 100 properties

Answer 1

+30 A:

You could try to 'normalize' it like you would a database table. Maybe put all the address related properties in an Address class for example - then have a BillingAddress and MailingAddress property of type Address in your Invoice class. These classes could be reused later on also.

Philip Wallace 2009-10-27 19:45:52

@Philip - sadly the data doesn't really subclass into tidy segments. Although there is the possibility of breaking some parts down... hmmm... will take another look. Thanks.

Metro Smurf 2009-10-27 19:58:00

I agree. Try to break up the data into smaller parts. 100 properties on a class is out of control.

Chuck Conway 2009-10-27 21:32:24

Also, what about having a generic List<Property> to hold all the properties?

RCIX 2009-11-02 11:49:40

Answer 2

+2 A:

Unless your code actually uses many of the attributes at many places, I'd go for a dictionary instead.

Having real properties has its advantages(type-safety, discoverability/intellisense, refactorability) but these don't matter if all the code does is gets these from elsewhere, displays on UI, sends in a web-service, saves to a file etc.

zvolkov 2009-10-27 19:45:53

@zvolkov - around 90% of the attributes are editable by the processors and is used throughout the UI.

Metro Smurf 2009-10-27 19:59:47

@Metro, that's irrelevant. Are they referenced by the code directly (e.g. Invoice.Subtotal)? If not, you can always marshal between dictionary and textboxes for editing purposes.

zvolkov 2009-10-27 20:03:28

@zvolkov - yes, they are directly referenced by the code. I'm not convinced a dictionary is the direction we want to go here; specifically, we would like to have everything strongly typed.

Metro Smurf 2009-10-27 20:09:44

But used in what way? This seems very much like a problem for a data driven approach, where rather than have a class you have a definition file that defines all the attributes you support, and metadata about them (like display name). The UI can pull from this dictionary an attribute to edit, from which it can get a display name, and a means to get the current value and to set new updated/new values.

Kendall Helmstetter Gelner 2009-10-27 20:10:28

On strong typing - do you mean to the attribute, or type as in "this is a number"? That can be modeled by having an editable attribute map back to a NumberType, PhoneNumberType, StringType, StringLimitedType, etc. You pull from the dictionary they type of object you are expecting, and the typing comes from working against the expected class type you are getting for any given attribute.

Kendall Helmstetter Gelner 2009-10-27 20:12:44

I think also, in general you are way too hung up on the static typing. If you want the ultimate in typing it involves, well, a lot of typing - as in 100 attributes defined in a class. For things like this a more flexible middle ground where you get some type checking at points but not 100%, is going to be so much easier to work with that the gains over any type mismatch issues will be huge.

Kendall Helmstetter Gelner 2009-10-27 20:14:50

@Kendall, I should have mentioned that we are using WPF and the Composite Application Library (Prism); several of the attributes will need to be dependency properties. Would using a definition file still be an option?

Metro Smurf 2009-10-27 21:02:17

If it calls for a data driven approach I think loadmasters suggestion (ie a data transfer class with all public fields) is better than a dictionary. It is both simpler and doesn't lose type safety, discoverability etc. Apart from introducing runtime flexibility (and bugs), a dictionary is pointless in this scenario.

Ash 2009-10-28 01:17:53

Answer 3

+1 A:

I used a Dictionary < string,string > for something like this.

it comes with a whole bunch of functions that can process it, it's easy to convert strings to other structures, easy to store, etc.

Oren Mazor 2009-10-27 19:46:40

... not statically typed, not type-safe in general, non-refactoring-friendly, not idiomatic C#, etc.

Pavel Minaev 2009-10-27 19:55:49

but it'll definitely get you laid.

Oren Mazor 2009-10-27 19:59:12

Great you can now apply the Calculate_Sales_Tax method to the Zip code.

James Anderson 2009-10-28 01:34:09

Answer 4

+2 A:

It would be too many columns when your class / table that you store it in starts to violate the rules of normalization.

In my experience, it has been very hard to get that many columns when you are normalizing properly. Apply the rules of normalization to the wide table / class and I think you will end up with fewer columns per entity.

Raj More 2009-10-27 19:48:12

Answer 5

+8 A:

I would imagine that some of these properties are probably related to each other. I would imagine that there are probably groups of properties that define independent facets of an Invoice that make sense as a group.

You may want to consider creating individual interfaces that model the different facets of an invoice. This may help define the methods and properties that operate on these facets in a more coherent, and easy to understand manner.

You can also choose to combine properties that having a particular meaning (addresses, locations, ranges, etc) into objects that you aggregate, rather than as individual properties of a single large class.

Keep in mind, that the abstraction you choose to model a problem and the abstraction you need in order to communicate with some other system (or business process) don't have to be the same. In fact, it's often productive to apply the bridge pattern to allow the separate abstractions to evolve independently.

LBushkin 2009-10-27 19:49:44

@LBushkin - thank you, re: Bridge Pattern. This is the type of answers I was looking for and will spend some time reading up on it.

Metro Smurf 2009-10-27 20:11:41

Answer 6

+6 A:

Hmmm... Are all of those really relevant specifically, and only to the invoice? Typically what I've seen is something like:

class Customer:
.ID
.Name

class Address
.ID 
.Street1
.Street2
.City
.State
.Zip

class CustomerAddress
.CustomerID
.AddressID
.AddressDescription ("ship","bill",etc)

class Order
.ID
.CustomerID
.DatePlaced
.DateShipped
.SubTotal

class OrderDetails
.OrderID
.ItemID
.ItemName
.ItemDescription
.Quantity
.UnitPrice

And tying it all together:

class Invoice
.OrderID
.CustomerID
.DateInvoiced

When printing the invoice, join all of these records together.

If you really must have a single class with 100+ properties, it may be better to use a dictionary

Dictionary<string,object> d = new Dictionary<string,object>();
d.Add("CustomerName","Bob");
d.Add("ShipAddress","1600 Pennsylvania Ave, Suite 0, Washington, DC 00001");
d.Add("ShipDate",DateTime.Now);
....

The idea here is to divide your into logical units. In the above example, each class corresponds to a table in a database. You could load each of these into a dedicated class in your data access layer, or select with a join from the tables where they are stored when generating your report (invoice).

David Lively 2009-10-27 19:50:27

What benefits will `Dictionary` give here, exactly, except for more clumsy syntax and lack of type safety?

Pavel Minaev 2009-10-27 19:56:25

Well, a typed dictionary wouldn't cause lack of type safety. Also, as I've said, normalizing the data is a much better way to go. Using a dictionary, though, is far more expedient than declaring a class with 100+ properties. I've NEVER seen an instance where that was necessary, or desirable.

David Lively 2009-10-27 19:59:17

@ztech - I can't see a dictionary being a viable solution if you have properties of differening types. Then you would have to store them as type Object. As Pavel said, this will quickly lead to type safety, for example when I put "foo" into InvoiceDate.

Philip Wallace 2009-10-27 20:15:23

Like I said, I don't think a dictionary is a good idea. But it's comparably bad to creating a class with 100+ properties. Choose your insanity.

David Lively 2009-10-27 20:46:18

Why is d.Get("Address_Line_1") better than inv.AddressLine[0]?Also in any reasoable business app you would probably have a good deal of validation rules for each field and the class as a whole.You end up with a humungous mess if you implement with a dictionary.

James Anderson 2009-10-28 10:47:42

"Are those really relevant " comment. Well probably yes in most businesses an "invoice" can also perform the functions of a pick list, a bill of lading, a shipping note, a receipt and sometimes a payment slip. So you need to support "In Transit", "Recieved By", "Damaged", "Rejected" , "Paid" "Overdue" etc. etc.

James Anderson 2009-10-28 10:52:35

Answer 7

+21 A:

The bad design is obviously in the system you are submitting to - no invoice has 100+ properties that cannot be grouped into a substructure. For example an invoice will have a customer and a customer will have an id and an address. The address in turn will have a street, a postal code, and what else. But all this properties should not belong directly to the invoice - an invoice has no customer id or postal code.

If you have to build an invoice class with all these properties directly attached to the invoice, I suggest to make a clean design with multiple classes for a customer, an address, and all the other required stuff and then just wrap this well designed object graph with a fat invoice class having no storage and logic itself just passing all operations to the object graph behind.

Daniel Brückner 2009-10-27 19:54:08

@Daniel - yes! you've nailed it, re: the internal system we submit to is the bottleneck.

Metro Smurf 2009-10-27 20:02:04

@Metro: that doesn't mean your object model has to suffer, just the part that submits it to the internal system *needs* to be ugly.

Austin Salonen 2009-10-27 22:01:27

Sounds like something similar to an adapter - you have your own object model, nicely broken into small classes, and one ugly class where you shove them so that it looks to the internal system as one giant class with 100 properties...

Mathias 2009-10-28 01:13:59

Answer 8

A:

Do you always need all the properties that are returned? Can you use projection with whatever class is consuming the data and only generate the properties you need at the time.

Gus Paul 2009-10-27 20:11:00

sounds a little cryptic? example plz?

zvolkov 2009-10-27 20:35:48

Answer 9

A:

You might find some useful ideas in this article about properties by Steve Yegge called the Universal Design Pattern.

kb 2009-10-27 20:57:23

Answer 10

A:

You could try LINQ, it will auto-gen properties for you. If all the fields are spread across multiple tables and you could build a view and drag the view over to your designer.

tom d 2009-10-27 21:24:04

Answer 11

A:

Dictionary ? why not, but not necessarily. I see a C# tag, your language has reflection, good for you. I had a few too large classes like this in my Python code, and reflection helps a lot :

for attName in 'attr1', 'attr2', ..... (10 other attributes):
    setattr( self, attName, process_attribute( getattr( self, attName ))

When you want to convert 10 string members from some encoding to UNICODE, some other string members shouldn't be touched, you want to apply some numerical processing to other members... convert types... a for loop beats copy-pasting lots of code anytime for cleanliness.

peufeu 2009-10-27 22:13:52

Answer 12

+2 A:

It's considered bad O-O style, but if all you're doing is populating an object with properties to pass them onward for processing, and the processing only reads the properties (presumably to create some other object or database updates), them perhaps a simple POD object is what you need, having all public members, a default constructor, and no other member methods. You can thus treat is as a container of properties instead of a full-blown object.

Loadmaster 2009-10-27 22:45:09

I agree, it may not be good O-O, but it's a better solution than a dictionary for this situation.

Ash 2009-10-28 01:19:01

@Loadmaster - some of the properties are initially populated. They are then updated as needed and stored until ready to be sent off to the consuming system. I'm going to update the question with a bit more background (may be later tonight or tomorrow).

Metro Smurf 2009-10-28 01:37:48

Answer 13

+1 A:

You should not be motivated purely by aesthetic considerations.

Per your comments, the object is basically a data transfer object consumed by a legacy system that expects the presence of all the fields.

Unless there is real value in composing this object from parts, what precisely is gained by obscuring its function and purpose?

These would be valid reasons:

1 - You are gathering the information for this object from various systems and the parts are relatively independent. It would make sense to compose the final object in that case based on process considerations.

2 - You have other systems that can consume various sub-sets of the fields of this object. Here reuse is the motivating factor.

3 - There is a very real possibility of a next generation invoicing system based on a more rational design. Here extensibility and evolution of the system are the motivating factor.

If none of these considerations are applicable in your case, then what's the point?

2009-10-28 00:51:29

@alphazero - I'm not sure I follow. But, to answer: 1=true, 2=possibly, but not now, 3 is unlikely in the coming years. Can you expand on your answer?

Metro Smurf 2009-10-28 01:29:03

In that case it makes sense to compose the object of parts. You can partition based on scenarios 1 or 2, depending on which is the stronger motivation.

2009-10-28 01:43:15

Answer 14

+1 A:

It sounds like for the end result you need to produce an invoice object with around 100 properties. Do you have 100 such properties in every case? Maybe you would want a factory, a class that would produce an invoice given a smaller set of parameters. A different factory method could be added for each scenario where the relevant fields of the invoice are relevant.

Frank Schwieterman 2009-10-28 01:03:53

@Frank - In our case we will be consuming most of the properties. The other properties are derived from what we consume.

Metro Smurf 2009-10-28 01:33:18

Answer 15

A:

If an entity has a hundred unique attributes than a single class with a hundred properties is the correct thing to do.

It may be possible to split things like addresses into a sub class, but this is because an address is really an entity in itself and easily recognised as such.

A textbook (i.e. oversimplified not usable in the real world) invoice would look like:-

 class invoice:
    int id;
    address shipto_address;
    address billing_address;
    order_date date;
    ship_date date;
    .
    .
    . 
    line_item invoice_line[999];

  class line_item;
    int item_no.
    int product_id;
    amt unit_price;
    int qty;
    amt item_cost;
    .
    .
    .

So I am surpised you dont have at least an array of line_items in there.

Get used to it! In the business world an entity can easily have hundreds and sometimes thousands of unique attributes.

James Anderson 2009-10-28 01:31:52

Not to nit-pick as I know the above is an off-the-cuff answer, but using a statically-sized array to hold line items is a bad idea. Better to use List<line_item> in C#.

David Lively 2009-10-28 15:16:27

Answer 16

A:

if all else fails, at least split the class to several partial classes to have better readability. it'll also make it easier for the team to work in parallel on different part of this class.

good luck :)

Ami 2009-10-28 14:23:41

Answer 17

+1 A:

If what you're trying to create is a table gateway for pre-existing 100-column table to this other service, a list or dictionary might be pretty quick way to get started. However if you're taking input from a large form or UI wizard, you're probably going to have to validate the contents before submission to your remote service.

A simple DTO might look like this:

class Form
{
    public $stuff = array();
    function add( $key, $value ) {}
}

A table gateway might be more like:

class Form
{
    function findBySubmitId( $id ) {} // look up my form
    function saveRecord() {}       // save it for my session
    function toBillingInvoice() {} // export it when done
}

And you could extend that pretty easily depending on if you have variations of the invoice. (Adding a validate() method for each subclass might be appropriate.)

class TPSReport extends Form { 
    function validate() {}
}

If you want to separate your DTO from the delivery mechanism, because the delivery mechanism is generic to all your invoices, that could be easy. However you might be in a situation where there is business logic around the success or failure of the invoice. And this is where I'm prolly going off into the weeds. But it's where and OO model can be useful...I'll wage a penny that there will be different invoices and different procedures for different invoices, and if invoice submission barfs, you'll need extra routines :-)

class Form { 
    function submitToBilling() {}
    function reportFailedSubmit() {}
    function reportSuccessfulSubmit() {}
}
class TPSReport extends Form { 
    function validate() {}
    function reportFailedSubmit() { /* oh this goes to AR */ }
}

Note David Livelys answer: it is a good insight. Often, fields on a form are each their own data structures and have their own validation rules. So you can model composite objects pretty quickly. This would associate each field type with its own validation rules and enforce stricter typing.

If you do have to get further into validation, often business rules are a whole different modelling from the forms or the DTOs that supply them. You could also be faced with logic that is oriented by department and has little to do with the form. Important to keep that out of the validation of the form itself and model submission process(es) separately.

If you are organizing a schema behind these forms, instead of a table with 100 columns, you would probably break down the entries by field identifiers and values, into just a few columns.

table FormSubmissions (
    id         int
    formVer    int -- fk of FormVersions
    formNum    int -- group by form submission
    fieldName  int -- fk of FormFields
    fieldValue text
)
table FormFields (
    id         int
    fieldName  char
)
table FormVersions (
    id
    name
)
select s.* f.fieldName from FormSubmissions s
left join FormFields f on s.fieldName = f.id
where formNum = 12345 ;

I would say this is definitely a case where you're going to want to re-factor your way around until you find something comfortable. Hopefully you have some control over things like schema and your object model. (BTW...is that table known a 'normalized'? I've seen variations on that schema, typically organized by data type...good?)

memnoch_proxy 2009-10-29 06:45:24

ansaurus

tags:

views:

answers:

Design pattern for class with upwards of 100 properties

related questions