ooxml

Problem using large binary segment in OOXML

System Description A plotting component that uses OOXML to generate a document. Plotting component consists of several parts. All parts are written in C++ as exe + dll's, with the exception of the interface to the OOXML document. The latter component is a COM component that was created in C#/.NET. The main reason for this is that the...

Printing an OOXML document without MS Word installed

Is there any way to print an OOXML document (.docx file) without having MS Word installed? It works nicely via the MS Word interface but I need to find a way to use it on servers where MS Word is not installed. I've been digging through the API and haven't found anything obvious so I'm inclined to believe there isn't a way. Is this the...

Word 2007, Open XML - embedding an image

Do you know what basic information MUST include a paragraph (<w:p/>) in document.xml inside a *.docx document, that specifies an image? I do know there must be: <a:blip r:embed="rId4" /> specifing the relationship id, but what else? It's very hard to find it in Google and experimenting with cutting out tags from a ready document or...

Create custom shape or connector in PowerPoint using code/script/definition

The shapes in in PowerPoint 2007 seem to be built in - is there a way to create 'user'/'custom' shapes? I wanted to create a custom connector, but knowing if you can create a custom shape, could be handy in the future. ...

What are some techniques for generating PowerPoint presentations without using Office Automation?

While it is possible to generate PowerPoint presentations automatically using Office Automation, this is not recommended for use on a server. How can you go about generating a PowerPoint presentation without using Office Automation? ...

Office Open XML (OOXML) Specification: Encryption

I am trying to understand how encrypted ("password protected") Office 2007 documents are bundled (specifically, Excel documents). I am experimenting with a known, password protected spread sheet. When I unzip the XLSX file, I encounter three entries: [6]DataSpaces (Directory) EncryptionInfo (File) EncryptedPackage (File) How is the ...

How do I save an Excel 2007 file in "OOXML" (xml text) so that I can modify it in code?

I made an Excel file with data on tab 2, and a chart on tab 1. This is for a web-portal where investors can download the excel document with ubber graphics and the like, but with their data. So, the 'simple' fix in my mind is to save the Excel document as "OOXML" and just replace the data items. However, it seems that the document is en...

.NET WinForms Editor Control that supports RTL\Hebrew and export to OOXML\PDF?

I'm interested in replacing an aging control we use in our project that is a simple HTML WYSIWYG editor. The main requirements i'm looking for are as follows: Support for RTL/Hebrew input. Support for custom numbering styles (not just 1. 2. 3., A. B. C., etc). Support fo exporting the content to OOXML (for Word 2007 inclusion) and PDF....

How do you open encrypted OOXML document in C#?

I want to open encrypted OOXML files, which are stored as streams inside a compound OLE file. Microsoft has an API for OOXML, (OOXML Format SDK 2.0, but it seems to lack a way to manipulate encrypted OOXML documents. So, this breaks into two questions. How do you open compound OLE documents in C#? I found this very similar question,...

How to grab text from word (docx) document in C#?

Hey all, I'm trying to get the plain text from a word document. Specifically, the xpath is giving me trouble. How do you select the tags? Here's the code I have. public static string TextDump(Package package) { StringBuilder builder = new StringBuilder(); XmlDocument xmlDoc = new XmlDocument(); xmlDoc.Load(package.GetP...

XLSX- how to get rid of the default namespace prefix x: ?

I'm generating XLSX spreadsheet using OOXML SDK, and I need to get rid of x: namespace prefix. How can I achieve this? using (SpreadsheetDocument doc = SpreadsheetDocument.Open("template.xlsx", true)) { //Save the shared string table part if (doc.WorkbookPart.GetPartsOfType().Count() > 0) ...

How can I query a Word docx in an ASP.NET app?

I would like to upload a Word 2007 or greater docx file to my web server and convert the table of contents to a simple xml structure. Doing this on the desktop with traditional VBA seems like it would have been easy. Looking at the WordprocessingML XML data used to create the docx file is confusing. Is there a way (without COM) to nav...

Different types of hyperlinks in Word document

I'm writing a small app the takes a path as input and then fetches every docx file in that path, replacing every hyperlink with a keyword. The strange thing is that I found two sorts of hyperlinks, the first one is from WordprocessingDocument ExternalRelationships and the other one is from finding every xml-node matching <w:insrText/> s...

How to extract plain text from a DOCX file using the new OOXML support in Apache POI 3.5?

On September 28, 2009 the Apache POI project released version 3.5 which officially supports the OOXML formats introduced in Office 2007, like DOCX and XLSX. Please provide a code sample for extracting a DOCX file's content in plain text, ignoring any styles or formatting. I am asking this because I have been unable to find any Apache P...

Simple Example of creating a PowerPoint file

I'm looking for some sample code on how to add an image and text & output it as a PowerPoint file. The host server does not have Office and I'm not allowed to install anything so the "Manipulating Excel 2007 and PowerPoint 2007 Files with the Open XML Format API " will not help me. Below is the code the output the PowerPoint. I just don'...

Where can I find good documentation on OpenXML

I'm looking to create powerpoint file in OpenXML. I'm looking to use the following features Bind bits of the page to an xml file. For example, one tag would be bound to the name field of an xml file. The plan is to simple drop the xml file in the right folder, zip up the template, and rename the file the proper extension and have it j...

How to identify and validate an OOXML file?

I need to be able to identify that a given file is an OOXML file based on the contents of the file, and not on the file's extension. OOXML files are really a collection of XML and text files in a zip container, which means that I cannot use the file's magic number as it will just indicate that it is a zip file. So what I'm really askin...

In PowerPoint 2007, how can I position a Callout's Tail programatically?

I'm looking at the DrawingML of a PowerPoint 2007 file and this is what it has for the Callout object's coordinates and geometry: <p:spPr> <a:xfrm> <a:off x="2819400" y="5181600"/> // X,Y Position of Callout Box <a:ext cx="609600" cy="457200"/> // Width,Height of Callout Box </a:xfrm> <a:prstGeom prst="wedgeR...

Open XML SDK v2.0 Performance issue when deleting a first row in 20,000+ rows Excel file

Do anyone come across a performance issue when deleting a first row in a 20,000+ rows Excel file using OpenXML SDK v2.0? I am using the delete row coding suggested in the Open XML SDK document. It takes me several minutes just to delete the first row using Open XML SDK, but it only takes just a second in Excel applicaton. I eventually ...

Schema/DTD of PowerPoint

I am looking for a way to add some meta-infromation to PowerPoint files. Specifically, I want to add annotation to bullets and slides. Are there any XML attributes which can be added to the elements (of the slideX.xml file) without invalidating it? In other words, where can I find the schema/DTD of the slideX.xml files? ...