ansaurus

Question

Best way to store data locally in .NET (C#)

Answer 1

+7 A:

XML is easy to use, via serialization. Use Isolated storage.

See also How to decide where to store per-user state? Registry? AppData? Isolated Storage?

public class UserDB 
{
    // actual data to be preserved for each user
    public int A; 
    public string Z; 

    // metadata        
    public DateTime LastSaved;
    public int eon;

    private string dbpath; 

    public static UserDB Load(string path)
    {
        UserDB udb;
        try
        {
            System.Xml.Serialization.XmlSerializer s=new System.Xml.Serialization.XmlSerializer(typeof(UserDB));
            using(System.IO.StreamReader reader= System.IO.File.OpenText(path))
            {
                udb= (UserDB) s.Deserialize(reader);
            }
        }
        catch
        {
            udb= new UserDB();
        }
        udb.dbpath= path; 

        return udb;
    }


    public void Save()
    {
        LastSaved= System.DateTime.Now;
        eon++;
        var s= new System.Xml.Serialization.XmlSerializer(typeof(UserDB));
        var ns= new System.Xml.Serialization.XmlSerializerNamespaces();
        ns.Add( "", "");
        System.IO.StreamWriter writer= System.IO.File.CreateText(dbpath);
        s.Serialize(writer, this, ns);
        writer.Close();
    }
}

Cheeso 2009-12-21 19:02:54

This isn't very portable nor neat

0A0D 2009-12-21 19:20:50

Portable? It's an illustration.

Cheeso 2009-12-21 19:27:00

It looks like cut-paste code so you can be the first poster. Would have been better to just stick with your links.

0A0D 2009-12-21 19:38:37

Well, yeah, I cut it right out of the app that I wrote, that does this. Roboto, what's yer problem?

Cheeso 2009-12-21 22:57:10

Answer 2

+12 A:

It really depends on what you're storing. If you're talking about structured data, then either XML or a very lightweight SQL RDBMS like SQLite or SQL Server Compact Edition will work well for you. The SQL solution becomes especially compelling if the data moves beyond a trivial size.

If you're storing large pieces of relatively unstructured data (binary objects like images, for example) then obviously neither a database nor XML solution are appropriate, but given your question I'm guessing it's more of the former than the latter.

Adam Robinson 2009-12-21 19:03:25

A freely available ADO.NET provider for SQLite is http://sqlite.phxsoftware.com/

John K 2009-12-21 19:48:49

XML config files have to be structured?

0A0D 2009-12-21 19:59:53

@Roboto: XML, by definition, is structured. That doesn't mean you have to use them in a highly-structured manner, however.

Adam Robinson 2009-12-21 20:07:57

Answer 3

A:

My first inclination is an access database. The .mdb files are stored locally, and can be encrypted if that is deemed necessary. Though XML or JSON would also work for many scenarios. Flat files I would only use for read only, non-search (forward read only) information. I tend to prefer csv format to set width.

Matthew Vines 2009-12-21 19:03:33

Out of curiousity, why would you use Access unless it was already present or you needed access to it from, well, Access? Otherwise it seems like a lightweight SQL engine would be more advisable, especially with in-process options like SQLite and SQL Server CE available.

Adam Robinson 2009-12-21 19:09:08

The JET engine - which allows you to use .MDB files is, I think, installed with windows which is what makes use of .MDB files attactive as a solution, that and the fact that they're easy to dig into with access if you need to. However that predates the current incarnation of SQL Server CE which can be a .DLL "xcopy" deployment and is therefore a better way to achieve a broadly similar result.

Murph 2009-12-21 19:16:46

Friends don't let friends use Access

0A0D 2009-12-21 19:21:26

@Murph: Yes, JET is a Windows component now, but (as you point out) the XCOPY deployment (and in-process hosting) of SQL Server CE seem to eliminate any advantage that JET has, while retaining the disadvantages (limited and odd SQL syntax support, many ORM's don't support it, etc.). So...I guess my question still stands as to why you'd recommend it :)

Adam Robinson 2009-12-21 19:31:30

Ouch! I can pretty much guarantee you'd come to regret this decision! Said out of personal experience.

Groky 2009-12-21 19:44:13

Answer 4

+6 A:

A fourth option to those you mention are binary files. Although that sounds arcane and difficult, it's really easy with the serialization API in .NET.

Whether you choose binary or XML files, you can use the same serialization API, although you would use different serializers.

To binary serialize a class, it must be marked with the [Serializable] attribute or implement ISerializable.

You can do something similar with XML, although there the interface is called IXmlSerializable, and the attributes are [XmlRoot] and other attributes in the System.Xml.Serialization namespace.

If you want to use a relational database, SQL Server Compact Edition is free and very lightweight and based on a single file.

Mark Seemann 2009-12-21 19:04:10

Flat file != text file. I would think this would fall under the category of "flat file".

Adam Robinson 2009-12-21 19:10:07

You can binary serialize a class whether or not you are dealing with XML files

0A0D 2009-12-21 19:14:18

unless you need the serialized object to be human readable, this is the most reliable way to go. It serializes to a tiny file, and always seemed the fastest way to do it, in terms of how fast the code ran. And Mark is right, it seems arcane and difficult, but it is not at all. And binary serialization captures the ENTIRE object, even its private members, which XML serialization does not.

CubanX 2009-12-21 20:11:38

Answer 5

A:

It depends on the amount of data you are looking to store. In reality there's no difference between flat files and XML. XML would probably be preferable since it provides a structure to the document. In practice,

The last option, and a lot of applications use now is the Windows Registry. I don't personally recommend it (Registry Bloat, Corruption, other potential issues), but it is an option.

blacksol 2009-12-21 19:06:18

One area where flat files differ from XML is representing hierarchical data. You can represent hierarchies in flat files, sure, but it's easier to do so in XML.

itowlson 2009-12-21 19:09:10

Answer 6

+2 A:

I recommend XML reader/writer class for files because it is easily serialized.

Serialization in C#

Serialization (known as pickling in python) is an easy way to convert an object to a binary representation that can then be e.g. written to disk or sent over a wire.

It's useful e.g. for easy saving of settings to a file.

You can serialize your own classes if you mark them with [Serializable] attribute. This serializes all members of a class, except those marked as [NonSerialized].

The following is code to show you how to do this:

using System;
using System.Collections.Generic;
using System.Text;
using System.Drawing;


namespace ConfigTest
{ [ Serializable() ]

    public class ConfigManager
    {
        private string windowTitle = "Corp";
        private string printTitle = "Inventory";

    public string WindowTitle
    {
            get
            {
                return windowTitle;
            }
            set
            {
                windowTitle = value;
            }
        }

    public string PrintTitle
    {
            get
            {
                return printTitle;
            }
            set
            {
                printTitle = value;
            }
    }
   }
}

You then, in maybe a ConfigForm, call your ConfigManager class and Serialize it!

public ConfigForm()
{
    InitializeComponent();
    cm = new ConfigManager();
    ser = new XmlSerializer(typeof(ConfigManager));
    LoadConfig();
}

private void LoadConfig()
{

            try
            {
                if (File.Exists(filepath))
                {
                    FileStream fs = new FileStream(filepath, FileMode.Open);
                    cm = (ConfigManager)ser.Deserialize(fs);
                    fs.Close();
                } 
                else
                {
                    MessageBox.Show("Could not find User Configuration File\n\nCreating new file...", "User Config Not Found");

                    FileStream fs = new FileStream(filepath, FileMode.CreateNew);
                    TextWriter tw = new StreamWriter(fs);
                    ser.Serialize(tw, cm);
                    tw.Close();
                    fs.Close();
                }

                setupControlsFromConfig();

            }
            catch (Exception ex)
            {
                MessageBox.Show(ex.Message);
            }
  }

After it has been serialized, you can then call the parameters of your config file using cm.WindowTitle, etc.

0A0D 2009-12-21 19:07:01

Just to clarify: Serializable and NonSerialized don't have any effect on XmlSerializer; they're used only for System.Runtime.Serialization (e.g. binary serialisation). XmlSerializer serialises public fields and (read-write) properties, not internal state: no attribute needed on the class, and XmlIgnore instead of NonSerialized to exclude a field or property.

itowlson 2009-12-21 19:14:23

@itowlson: Correct. XML serialization uses reflection to generate special classes to perform the serialization.

0A0D 2009-12-21 19:17:07

It would help when reading the code if it was indented without the random size...

Lasse V. Karlsen 2009-12-23 20:00:45

@Lasse: Not sure what you mean, but if it is too hard to read then you can edit it

0A0D 2009-12-23 20:07:16

Answer 7

A:

Without knowing what your data looks like, i.e. the complexity, size, etc...XML is easy to maintain and easily accessible. I would NOT use an Access database, and flat files are more difficult to maintain over the long haul, particularly if you are dealing with more than one data field/element in your file.

I deal with large flat-file data feeds in good quantities daily, and even though an extreme example, flat-file data is much more difficult to maintain than the XML data feeds I process.

A simple example of loading XML data into a dataset using C#:

DataSet reportData = new DataSet();

reportData.ReadXml(fi.FullName);

You can also check out LINQ to XML as an option for querying the XML data...

HTH...

Tom Miller 2009-12-21 19:15:53

Answer 8

+1 A:

I have done several "stand alone" apps that have a local data store. I think the best thing to use would be SQL Server Compact Edition (formerly known as SQLAnywhere).

It's lightweight and free. Additionally, you can stick to writing a data access layer that is reusable in other projects plus if the app ever needs to scale to something bigger like full blown SQL server, you only need to change the connection string.

Loki Stormbringer 2009-12-21 19:17:15

Answer 9

A:

If you go the binary serialization route, Consider the speed at which a particular member of the datum needs to be accessed. If it is only a small collection, loading the whole file will make sense, but if it will be large, you might also consider an index file.

Tracking Account Properties/fields that are located at a specific address within the file can help you speed up access time, especially if you optimize that index file based on key usage. (possibly even when you write to disk.)

fauxtrot 2009-12-21 19:37:07

Answer 10

+2 A:

A lot of the answers in this thread attempt to overengineer the solution. If I'm correct, you just want to store user settings.

Use an .ini file or App.Config file for this.

If I'm wrong, and you are storing data that is more than just settings, use a flat text file in csv format. These are fast and easy without the overhead of XML. Folks like to poo poo these since they aren't as elegant, don't scale nicely and don't look as good on a resume, but it might be the best solution for you depending on what you need.

James 2009-12-21 19:37:50

app.config vs custom XML: http://stackoverflow.com/questions/1565898/app-config-vs-custom-xml-file

0A0D 2009-12-21 19:43:40

I'm doing something slightly more complex than just settings. Each user might have multiple 'accounts' associated with their name. The dictionary links this name (string) to the list of accounts associated with them. I'd be storing a bunch of accounts for each user. It could work with xml but i'm not quite sure how to go about it.

George 2009-12-21 19:55:03

In that case, I'd use the XmlSerializer class as mentioned. If you have a good grasp on OOP, it should be easy. Here's a good example: http://www.jonasjohn.de/snippets/csharp/xmlserializer-example.htm

James 2009-12-22 23:11:13

Answer 11

+3 A:

If your data is complex, high in quantity or you need to query it locally then object databases might be a valid option. I'd suggest looking at Db4o or Karvonite.

Goran 2009-12-21 19:40:06

Answer 12

+1 A:

All of the above are good answers, and generally solve the problem.

If you need an easy, free way to scale to millions of pieces of data, try out the ESENT Managed Interface project on CodePlex.

ESENT is an embeddable database storage engine (ISAM) which is part of Windows. It provides reliable, transacted, concurrent, high-performance data storage with row-level locking, write-ahead logging and snapshot isolation. This is a managed wrapper for the ESENT Win32 API.

It has a PersistentDictionary object that is quite easy to use. Think of it as a Dictionary() object, but it is automatically loaded from and saved to disk without extra code.

For example:

/// <summary>
/// Ask the user for their first name and see if we remember 
/// their last name.
/// </summary>
public static void Main()
{
    PersistentDictionary<string, string> dictionary = new PersistentDictionary<string, string>("Names");
    Console.WriteLine("What is your first name?");
    string firstName = Console.ReadLine();
    if (dictionary.ContainsKey(firstName))
    {
        Console.WriteLine("Welcome back {0} {1}", firstName, dictionary[firstName]);
    }
    else
    {
        Console.WriteLine("I don't know you, {0}. What is your last name?", firstName);
        dictionary[firstName] = Console.ReadLine();
    }

To answer George's question:

Supported Key Types

Only these types are supported as dictionary keys:

Boolean Byte Int16 UInt16 Int32 UInt32 Int64 UInt64 Float Double Guid DateTime TimeSpan String

Supported Value Types

Dictionary values can be any of the key types, Nullable versions of the key types, Uri, IPAddress or a serializable structure. A structure is only considered serializable if it meets all these criteria:

• The structure is marked as serializable • Every member of the struct is either: 1. A primitive data type (e.g. Int32) 2. A String, Uri or IPAddress 3. A serializable structure.

Or, to put it another way, a serializable structure cannot contain any references to a class object. This is done to preserve API consistency. Adding an object to a PersistentDictionary creates a copy of the object though serialization. Modifying the original object will not modify the copy, which would lead to confusing behavior. To avoid those problems the PersistentDictionary will only accept value types as values.

Can Be Serialized [Serializable] struct Good { public DateTime? Received; public string Name; public Decimal Price; public Uri Url; }

Can’t Be Serialized [Serializable] struct Bad { public byte[] Data; // arrays aren’t supported public Exception Error; // reference object }

GalacticJello 2009-12-21 19:41:56

This method is basically replacing the built in generic with a persistent dictionary. It's a pretty elegant solution, but how does it handle complex objects like in the OP example? Does it store everything inside the dict, or just the dict itself?

George 2009-12-21 19:57:22

This could fall short in the ultimate goal of trying to save the lists of type Account. The key's would be fine, but making the generic serializable could be hard :/.

George 2009-12-21 20:22:11

Answer 13

A:

Depending on the compelexity of your Account object, I would recomend either XML or Flat file.

If there are just a couple of values to store for each account, you could store them on a properties file, like this:

account.1.somekey=Some value
account.1.someotherkey=Some other value
account.1.somedate=2009-12-21
account.2.somekey=Some value 2
account.2.someotherkey=Some other value 2

... and so forth. Reading from a properties file should be easy, as it maps directly to a string dictionary.

As to where to store this file, the best choise would be to store into AppData folder, inside a subfolder for your program. This is a location where current users will always have access to write, and it's kept safe from other users by the OS itself.

Pablo 2009-12-21 19:44:39

Answer 14

+1 A:

I'd store the file as JSON. Since you're storing a dictionary which is just a name/value pair list then this is pretty much what json was designed for.
There a quite a few decent, free .NET json libraries - here's one but you can find a full list on the first link.

zebrabox 2009-12-21 20:19:27

This looks like a great option, I'll look into it more carefully.

George 2009-12-21 20:35:50

Answer 15

A:

Keep it simple - as you said, a flat file is sufficient. Use a flat file.

This is assuming that you have analyzed your requirements correctly. I would skip the serializing as XML step, overkill for a simple dictionary. Same thing for a database.

Larry Watanabe 2009-12-21 20:23:48

Answer 16

+1 A:

The first thing I'd look at is a database. However, serialization is an option. If you go for binary serialization, then I would avoid BinaryFormatter - it has a tendency to get angry between versions if you change fields etc. Xml via XmlSerialzier would be fine, and can be side-by-side compatible (i.e. with the same class definitions) with protobuf-net if you want to try contract-based binary serialization (giving you a flat file serializer without any effort).

Marc Gravell 2009-12-21 20:35:50

Answer 17

+1 A:

If your collection gets too big, I have found that Xml serialization gets quite slow. Another option to serialize your dictionary would be "roll your own" using a BinaryReader and BinaryWriter.

Here's some sample code just to get you started. You can make these generic extension methods to handle any type of Dictionary, and it works quite well, but is too verbose to post here.

 class Account
{
    public string AccountName { get; set; }
    public int AccountNumber { get; set; }

    internal void Serialize(BinaryWriter bw)
    {
        // Add logic to serialize everything you need here
        // Keep in synch with Deserialize
        bw.Write(AccountName);
        bw.Write(AccountNumber);
    }

    internal void Deserialize(BinaryReader br)
    {
        // Add logic to deserialize everythin you need here, 
        // Keep in synch with Serialize
        AccountName = br.ReadString();
        AccountNumber = br.ReadInt32();
    }
}


class Program
{
    static void Serialize(string OutputFile)
    {
        // Write to disk 
        using (Stream stream = File.Open(OutputFile, FileMode.Create))
        {
            BinaryWriter bw = new BinaryWriter(stream);
            // Save number of entries
            bw.Write(accounts.Count);

            foreach (KeyValuePair<string, List<Account>> accountKvp in accounts)
            {
                // Save each key/value pair
                bw.Write(accountKvp.Key);
                bw.Write(accountKvp.Value.Count);
                foreach (Account account in accountKvp.Value)
                {
                    account.Serialize(bw);
                }
            }
        }
    }

    static void Deserialize(string InputFile)
    {
        accounts.Clear();

        // Read from disk
        using (Stream stream = File.Open(InputFile, FileMode.Open))
        {
            BinaryReader br = new BinaryReader(stream);
            int entryCount = br.ReadInt32();
            for (int entries = 0; entries < entryCount; entries++)
            {
                // Read in the key-value pairs
                string key = br.ReadString();
                int accountCount = br.ReadInt32();
                List<Account> accountList = new List<Account>();
                for (int i = 0; i < accountCount; i++)
                {
                    Account account = new Account();
                    account.Deserialize(br);
                    accountList.Add(account);
                }
                accounts.Add(key, accountList);
            }
        }
    }

    static Dictionary<string, List<Account>> accounts = new Dictionary<string, List<Account>>();

    static void Main(string[] args)
    {
        string accountName = "Bob";
        List<Account> newAccounts = new List<Account>();
        newAccounts.Add(AddAccount("A", 1));
        newAccounts.Add(AddAccount("B", 2));
        newAccounts.Add(AddAccount("C", 3));
        accounts.Add(accountName, newAccounts);

        accountName = "Tom";
        newAccounts = new List<Account>();
        newAccounts.Add(AddAccount("A1", 11));
        newAccounts.Add(AddAccount("B1", 22));
        newAccounts.Add(AddAccount("C1", 33));
        accounts.Add(accountName, newAccounts);

        string saveFile = @"C:\accounts.bin";

        Serialize(saveFile);

        // clear it out to prove it works
        accounts.Clear();

        Deserialize(saveFile);
    }

    static Account AddAccount(string AccountName, int AccountNumber)
    {
        Account account = new Account();
        account.AccountName = AccountName;
        account.AccountNumber = AccountNumber;
        return account;
    }
}

GalacticJello 2009-12-21 20:52:27

Thanks, this looks like the best solution so far. What do you mean by keep in sync with Deserialize/Serialize? As in updating the file when its modified? This function will only be used on application start and exit to save the dict, so could you please clarify that?Otherwise thanks a lot.

George 2009-12-22 01:59:09

After thinking about it for a bit, I've realised it means that the logic for serialising and deserialising should be the same. That is all.

George 2009-12-22 14:48:45

Yes, that's all it means. So if you add another property to serialize/deserialize, just keep in mind that you have to add code to both the Serialize/Deserialize method, and keep them in the same order. A bit of maintenence, but the performance over Xml serialization is no comparison (several minutes to deserialize using xml, to a couple seconds using BinaryReader, with a few hundred thousand dictionary items).

GalacticJello 2009-12-22 16:08:35

Answer 18

+2 A:

Just finished coding data storage for my current project. Here is my 5 cents.

I started with binary serialization. It was slow (about 30 sec for load of 100,000 objects) and it was creating a pretty big file on the disk as well. However, it took me a few lines of code to implement and I got my all storage needs covered. To get better performance I moved on custom serialization. Found FastSerialization framework by Tim Haynes on Code Project. Indeed it is a few times faster (got 12 sec for load, 8 sec for save, 100K records) and it takes less disk space. The framework is built on the technique outlined by GalacticJello in a previous post.

Then I moved to SQLite and was able to get 2 sometimes 3 times faster performance – 6 sec for load and 4 sec for save, 100K records. It includes parsing ADO.NET tables to application types. It also gave me much smaller file on the disk. This article explains how to get best performance out of ADO.NET: http://sqlite.phxsoftware.com/forums/t/134.aspx. Generating INSERT statements is a very bad idea. You can guess how I came to know about that. :) Indeed, SQLite implementation took me quite a bit of time plus careful measurement of time taking by pretty much every line of the code.

ACH 2009-12-23 20:34:23

ansaurus

tags:

views:

answers:

Best way to store data locally in .NET (C#)

related questions