ansaurus

Question

Best practices for using and persisting enums

Answer 1

+2 A:

Imho, as for the code part:

You should always use the 'enum' type for your enumerations, basically you get alot of freebies if you do: Type safety, encapsulation and switch avoidance, the support of some collections such as EnumSet and EnumMap and code clarity.

as for the persistence part you can always persist the string representation of the enum and load it back using the enum.valueOf(String) method.

MahdeTo 2009-04-14 11:27:05

Agree in principle, however at least in Java "enum" is limited in that it cannot have a superclass (as noted above), so sometimes a "typesafe enum" class is probably better.

sleske 2009-04-14 11:45:47

Answer 2

+2 A:

Storing the text value of an enum in a database is less preferred to storing an integer, due to the additional space required and slower searching. It is valuable in that it has more meaning than a number, however the database is for storage, and the presentation layer is for making things look nice.

ck 2009-04-14 11:33:59

The enum's int value is not guaranteed to be the same over time.

Miguel Ping 2009-04-14 11:43:13

Also, if you use a short string, performance should be the same. A char(2) takes 2 bytes, an int usually also takes 2 or 4.

sleske 2009-04-14 11:44:42

@Miguel Ping: The idea is to *explicitly* assing an ID (int or char) to each enum. Using the internally generated int of the enum is indeed very dangerous.

sleske 2009-04-14 11:47:13

Answer 3

+2 A:

Java or C# should always use enums in code. Disclaimer: My background is C#.

If the value is to be persisted to a database, the integral values of each enumeration member should be explicitly defined so that a later change in code does not accidentally alter translated enum values and thus application behavior.

Values should always be persisted to a database as integral values, to protect against enum name refactoring. Keep documentation on each enumeration in a wiki and add a comment to the database field pointing to the wiki page documenting the type. Also add XML documentation to the enum type containing a link to the wiki entry so that it is available through Intellisense.

If you use a tool to generate CRUD code it should be capable of defining an enumeration type to use for a column so that generated code objects always use enumerated members.

If custom logic needs to be applied for an enumeration member, you have some options:

If you have an enum MyEnum, create a static class MyEnumInfo which offers utility methods to discover additional information about the enum member, by switch statements or whatever means necessary. Appending "Info" to the end of the enum name in the class name ensures that they will be next to each other in IntelliSense.
Decorate the enumeration members with attributes to specify additional parameters. For example we have developed an EnumDropDown control which creates an ASP.NET dropdown filled with enumeration values, and an EnumDisplayAttribute specifies the nicely formatted display text to use for each member.

I have not tried this, but with SQL Server 2005 or later, you could theoretically register C# code with the database that would contain enum information and the ability to convert values to enums for use in views or other constructs, making a method of translating the data in a manner easier for DBAs to use.

David 2009-04-22 17:26:51

Answer 4

+4 A:

In the code handling for C# you've missed out defining delcaring the 0 value. I almost without fail always declare my first value as:

public enum SomeEnum
{
    None = 0,
}

So as to serve as a null value. Because the backing type is an integer and an integer defaults to 0 so it is massively useful in a lot of places to know if an enum has actually been programatically set or not.

Quibblesome 2009-04-23 15:11:12

I disagree. This would only make sense if you sometimes leave variables uninitialized, which I would consider seriously bad practice. I have often seen this idea of having a "none" value, but I believe it only hides the real problem (the uninitialized variable).

sleske 2009-04-28 08:07:53

How does it hide the problem? It makes it explicit like a nullable int.I leave values uninitialised in code because I know what the CLR will set them to by default. They're still initialised its just implicit.

Quibblesome 2009-04-28 09:27:07

Well, it's probably a matter of style. I strongly believe in fully initializing all variables on declaration (or at most in an if-else directly after declaration). Otherwise you might forget to initialize them, especially if the code flow is complicated. See also http://c2.com/cgi/wiki?SingleStepConstructor.

sleske 2009-04-29 10:53:44

Answer 5

+2 A:

Well, from my experience, using enums for anything other than for passing options (as flags) to an immediate method call, results in switch-ing at some point.

If you are going to use the enum all over your code, then you might end up with code that is not so easy to maintain (the infamous switch statement)
Extending enums is a pain. You add a new enum item and end up in going through all of your code to check for all conditions.
With .NET 3.5, you can add extension methods to enums to make them behave a bit more like classes. However, adding real functionality this way is not so easy since it's still not a class (you would end up using switch-es in your extension methods if not elsewhere.

So for an enum-like entity with a bit more of functionality you should take some time and create it as a class, with several things in mind:

To make your class behave like an enum, you can either force each derived class to instantiate as a Singleton, or override Equals to allow value comparison of different instances.
If your class is enum-like, it should mean that it should contain no serializable state - deserialization should be possible from its type alone (a sort of an "ID", as you said).
Persistence logic should be confined to the base class only, otherwise extending your "enum" would be a nightmare. In case that you went for the Singleton pattern, you would need to ensure proper deserialization into singleton instances.

Groo 2009-04-27 08:07:40

Answer 6

+2 A:

Each time you find your self using "magic numbers" in code change to enums. Besides time savings ( since magic will disappear when the bugs come ...) it will save your eyes and memory (meaningful enums make code more readable and self-documenting), since guess what - you are most probably the person to maintain and develop your own code

YordanGeorgiev 2009-04-27 19:31:19

Answer 7

+2 A:

The initial article looks fine to me. Still, based on the comments, it seems some comments concerning Java enums might clarify few things.

Enum type in Java is a class by definition, but many programmers tend to forget this, because they rather relate it to "a list of allowed values" as in some other languages. It's more than that.

So, to avoid those switch statements it might be reasonable to put some code and additional methods in the enum class. There's almost never a need to create a separate "enum-like real class".

Consider also the point of documentation - do you want to document the actual meaning of your enum in the database? In the source code reflecting the values (your enum type) or in some external documentation? I personally prefer the source code.

If you want to present enum values as integers in the database due to speed or whatever reason, that mapping should also reside in the Java enum. You'll get string-name mapping by default, and I've been content with that. There's an ordinal number associated with each enum value, but using that directly as a mapping between code and database is not very bright, because that ordinal number will change if someone reorders the values in the source code. Or adds additional enum values in between existing values. Or removes some value.

(Of course, if someone changes the name of the enum in the source code, the default string-mapping goes sour too, but that's less likely to happen accidentally. And you can more easily protect against that if necessary by putting some runtime-checking and check constraints in the database as suggested here already. )

lokori 2009-04-28 15:50:39

There are two scenarios to support: someone reordering the enums in my file OR someone doing some refactoring (to clarify poor initial name choices) and breaking persisted data. I think the latter is more important, and ordinal is the way to go for data persistence.

Justin 2010-05-12 21:18:35

Answer 8

+4 A:

I agree with much of what you say. One thing I'd like to append, though, about the persistence of enums: I don't believe the generation of the enums at build time from the DB values is acceptable, but I also think that the runtime check is not a good solution. I'd define a third means: have a unit test which will check the values of the enum against the database. This prevents "casual" divergence, and avoids the overhead of checking the enums against the database every time the code is run.

McWafflestix 2009-04-28 20:12:42

ansaurus

tags:

views:

answers:

Best practices for using and persisting enums

related questions