Dear all:
I have encountered a problem that I need to import a huge XML (> 1Gb) into SQL Server 2008 daily. What I have now is a sample XML file and the XML schema of it. The XML schema is pretty complex which contains many custom defined simple type, and element with complex type such as:
<xs:element name="xxxx_url">
<xs:complexType>
<xs:simpleContent>
<xs:extension base="xs:anyURI">
<xs:attribute ref="target" use="optional"/>
<xs:attribute ref="abc" use="optional"/>
</xs:extension>
</xs:simpleContent>
</xs:complexType>
</xs:element>
After import, a WCF service will be implemented to retrieve the data stored in SQL Sever, something like search, retrieve etc (read-only operations).
The implementation steps I can think of are like:
- Define an object model according to the provided XSD (manually), the object model will be used for WCF service to return values.
- Define a database schema from the provided XSD (manually), the schema is estimated to have about 20 - 30 tables.
- Create a SSIS package to load XML into database daily.
- Create a WCF service which reads from database, populates data into the object model defined in step 1 and returns the object to service client.
The problem is that these steps involve lots of manual work. I have to research the XSD row by row, and transform it to object model and database schema mannualy.
I did some research that there're some automation tools to transform XSD into classes, and also transform XSD into database schema. But the classes transformed from XSD with the tool are pretty messed, and the transformation to schema is failed because it does not conform to MS dataset format.
I am wondering is there any good solution to this problem, to save a lot of manual work?
Any suggestion is appreciated !