ansaurus

Question

Sensible Way to Pass Web Data in XML to a SQL Server Database

Answer 1

A:

I don't see any reason not to use XML columns in SQL Server 2005, and to do all your work via stored procedures.

You probably don't have time to abstract your data access to hide the ugliness of the data model, so why not just access it as-is, using XML? You can use XQuery in SQL Server to do updates, queries, etc.

Now that I think of it, you might still put one layer of abstraction between the ASP pages and the database. That would allow you in the future to use XSLT to transform the structure of your XML into a format that will perform better in the database.

John Saunders 2010-03-07 00:42:42

What do you mean by "XML columns?" Some brief examples of what you're talking about would be good. I don't mean working code, I'm just having trouble connecting what you're saying to the real world. For example, what format would perform better in the database?Note that the question here isn't about reading the database--that works fine. It's about writing back to the database. Though I suppose if multiple requests are working for reading, then multiple requests for writing could be okay. It's just not so simple to submit entire recordsets as parameters to SPs.

Emtucifor 2010-03-07 01:48:39

@Emtucifor: SQL Server 2005 has a new column data type, XML. See http://technet.microsoft.com/en-us/library/ms190936%28SQL.90%29.aspx. Sorry, I thought you mentioned SQL Server 2005 because you knew about XML columns.

John Saunders 2010-03-07 02:06:34

I do know about XML columns, but I don't think actually storing the data as XML is the right route, here. Were you suggesting changing the database structure to use XML columns? Or were you instead suggesting the web application insert to an XML column and then do its updates from there? Or am I missing that an input parameter can be XML data type?

Emtucifor 2010-03-08 18:51:11

@John Saunders: Any thoughts on my last comment?

Emtucifor 2010-03-31 18:09:06

@Emtucifor: I had been suggesting XML columns. But since you're stuck on SQL Server 2000 for a while, given that you're using Classic ASP, and given what @Remus said about severity 16, I'm not so sure anymore.

John Saunders 2010-03-31 18:45:14

Answer 2

+3 A:

If I understand correctly, you are interested in the pros and cons of using XML as a format of data between the database and the application (in this case, a web app).

If you happen to have the entire data to be inserted/updated/deleted as a handy bag of data in your client, then actually sending it as XML makes sense. On simple reason is that this would allow for a single database round-trip to the server, and reducing round-trips is always a good think. But the most important advantage is that you can employ the holy-grail of database performance: set oriented processing. Using XML methods, specially nodes and value, combined with some moderate XPath-fu skills, and you can shred the entire XML parameter received from the application into relational sets, and use set oriented operations to do the database writes.

Take for instance the XML in your post, lets say that it was passed as an parameter named @x of type XML. You can shred that into an attributes to be merged into existing elements:

select x.value(N'@ID', N'int') as ID,
  x.value(N'.', N'varchar(max)') as [Value]
from  @x.nodes('//Element[not(@Action="delete") and not (@ID=0)]/Attr') t(x)

You can shred the attributes that go into new elements:

select x.value(N'@ID', N'int') as ID,
  x.value(N'.', N'varchar(max)') as [Value]
from  @x.nodes('//Element[@ID=0]/Attr') t(x);

And you can shred the elements to be deleted:

select x.value(N'@ID', N'int') as ID
from  @x.nodes('//Element[@Action="delete"]') t(x);

These sets can be manipulated via normal SQL DML: inserted, deleted, updated or merged into the EAV tables, in one single pass. Note that the XML shredding I show here are trivial ones and probably incorrect for you, but are just to show the way to do it.

Now whether this is the best path to go, I don't know. There are way too many variables and moving pieces and they lay mostly in your dev team skill set and existing code base. For sure, XML is a good format to call into the database for updating sets, but XML has its shortcomings too: is verbose and fat, is slower to parse than binary formats, and is actually quite difficult to fully grok by programmers: once you get past the sugar coat of '<' and '>', there a deep (and sometimes messy) layer of XPath, XQuery, namespaces, encodings, cdata and the rest.

I'd say go ahead, prototype, let us know how it goes...

Remus Rusanu 2010-03-07 06:45:04

Remus, all the reasons you mention are the exact ones I was thinking about! I do have all the data at once and can stuff it into a property bag, as you say. And exactly so, I can query the XML repeatedly to extract different data sets and then do my joins (row-by-row is truly evil). In the past I've not been impressed with XML because of how verbose and slow it is. And I also have my reservations about all the extra gunk you mentioned like namespaces, encodings, and cdata (I want to be able to handle any unicode character properly).

Emtucifor 2010-03-08 17:25:51

So... given that you jumped right in gave sample queries for me, it seems that you sort of approve of the XML route? Of course, for now I am stuck with SQL 2000. Do you have any other thoughts on the format of my XML (e.g., is Action="delete" and the mix of element-based and attribute-based data sensible) or on any other good ways to hand in an entire property bag? Last night I did realize that all the things I'm updating share a fairly standard representation, and if I add ElementID1 and ElementID2 to the existing update process, then I could do without XML.

Emtucifor 2010-03-08 17:28:56

Also, I have concerns about creating the XML in the first place. I've found the MSXML libraries to be painfully slow, even to build XML, and I've done the (gasp! horror!) workaround of just concatenating the XML together manually in the past. But that only works when it's kind of simple, and this looks to be like it won't be so simple.

Emtucifor 2010-03-08 17:29:54

I do not disprove of XML, but I can't recommend it neither since, as you are clearly aware, there are just too many moving pieces. but I will tell you a short story from my professional history. I deal a lot with Service Broker based systems, and in SSB the norm is to get the data as XML: dequeue a message from a queue, then start processing the XML. You can see on my blog at http://rusanu.com/2006/10/16/writing-service-broker-procedures/ how various styles of processing payload impact performance. XML set oriented is pretty much the fastest way.

Remus Rusanu 2010-03-08 19:07:43

However, I am finding myself *removing* set oriented processing and replacing it with cursor based processing, row-by-row ('row' extracted from the XML). The reason for this is complexity. Maintaining the set oriented procedures is just hard to do, projects are taking longer than needed, have more bugs, simply because is hard to wrap one's head around the complex XPath queries involved. And a major problem is error handling: is hard to detect if *any* xml element was simply not-processed (ie. is skipped by all XPath queries)

Remus Rusanu 2010-03-08 19:10:03

@Remus: are you using XML Schema with the XML data? SQL Server is capable of validating the XML against a schema.

John Saunders 2010-03-08 19:19:49

@John: No, I don't use an XML schema. Schema validation wouldn't help me though. The problem is more one of defensive code style: the code works fine with all the known cases and passes functional test cases, but in production you want to be able to detect *new* cases (they always happen eventually) and trigger the alarm yet allow the processing to continue. I rather rely on code path (ie. a switch default case) than XML schemas for this. A second reason is the severity 16 that XML errors are raised with, see http://rusanu.com/2007/10/31/error-handling-in-service-broker-procedures/

Remus Rusanu 2010-03-08 19:43:48

I may go with XML as I've figured out a nifty way to get my data into an XML object: build an ADO Recordset and turn it into a Stream! This will be much simpler for me than using MSXML and won't require installing MSXML either.

Emtucifor 2010-03-12 02:37:00

@Remus: I didn't know about severity 16. Thanks for that.

John Saunders 2010-03-31 18:44:14

ansaurus

tags:

views:

answers:

Sensible Way to Pass Web Data in XML to a SQL Server Database

related questions