extraction

What code libraries are there to rip audio CDs that are suitable for commerical use?

What code libraries are there to rip audio CDs (e.g. DAE, Digital Audio Extraction)? Must be suitable for commerical use (e.g., no GPL) Preferably free. Linux or FreeBSD or Windows. Most interested in libraries that run on unix-like OS's, especially FreeBSD. ...

Mailing Address tokenization/elementization to individual components (street, city, etc.)

I need to parse international addresses to its individual components (street, city, etc.). After some reasonable survey, I found that HMMs/CRFs are the way to go. Has anybody had any success using open source implementation of HMMs or CRFs for the address tokenization problem. If yes, what are they? Also, do any implementations provide...

Programmatically extract keywords from domain names

Let's say I have a list of domain names that I would like to analyze. Unless the domain name is hyphenated, I don't see a particularly easy way to "extract" the keywords used in the domain. Yet I see it done on sites such as DomainTools.com, Estibot.com, etc. For example: ilikecheese.com becomes "i like cheese" sanfranciscohotels.com be...

Is it possible to get any database's elements as a set queriable classes with EF?

Hello, I'm trying to make an application in which both the developer and the end user are allowed retrieve data from a relational DB (chosen at runtime) without writing any SQL code at all (and preferably no string concatenations when interacting with the database). The purpose of the application is to two things: a GUI to an expression...

TCP flow extraction

Hello My Friends, I need to extract TCP Flows with their content from dump file and then save their flow into other file each flow separately, does any one know a tool for processing this? I really appreciate for any Help Hanieh Rajabi. ...

Simplehtmldom - curl, loops, arrays?

Hello. Pse forgive what is most likely a stupid question. I've successfully managed to follow the simplehtmldom examples and get data that I want off one webpage. I want to be able to set the function to go through all html pages in a directory and extract the data. I've googled and googled but now I'm confused as I had in my ignora...

How to extract attachments from a ZFO?

Does anyone know how to extract the attachments from a .zfo file (602 form filler). Its a format based on openXML. Thank you ...

Extracting caption from alt attribute and inserting into a div

I am using the rather lovely jQuery slideviewer 1.1 plugin on a site at the moment, but would like to extract the alt attribute from images displayed and insert them into a div at the appropriate time. The current code for the plugin is shown below for reference: jQuery(function(){ jQuery("div.svw").prepend("<img src='/template/them...

How can I extract the first paragraph of a PDF document using Perl's CAM::PDF?

How can I extract the first paragraph of a PDF document using Perl's CAM::PDF? ...

Third party decompression on Blackberry

I know the BB only handles GZip and ZLib compression/decompression. my app gets essentially a zip file containing a large number of possible file types (txt, doc, html, wmv, png, gif, etc)...decrypts and displays the file a user selects from a tree. Using these decompression algorithms is not a good way to go having to decompress/decrypt...

efficient TIFF tile extraction C++

I am working with 1gb large tiff images of around 20000 x 20000 pixels. I need to extract several tiles (of about 300x300 pixels) out of the images, in random positions. I tried the following solutions: Libtiff (the only low level library I could find) offers TIFFReadline() but that means reading in around 19700 unnecesary pixels. I i...

Microsoft research on fact extraction from the web?

Some time ago I had accidentally found website which was apparently product of Microsoft research on fact extraction from the web, more specifically from the Wikipedia. Right now I would like to have better look into how it works, but the problem is i can't find it, maybe somebody knows what I am talking about and could give me direction...

What Is The Best Python Zip Module To Handle Large Files?

EDIT: Specifically compression and extraction speeds. Any Suggestions? Thanks ...

Extracting information from PDFs of research papers

I need a mechanism for extracting bibliographic metadata from PDF documents, to save people entering it by hand or cut-and-pasting it. At the very least, the title and abstract. The list of authors and their affiliations would be good. Extracting out the references would be amazing. Ideally this would be an open source solution. The ...

Javascript function parameter names at time of call?

I know this is a really long shot, but I figure I'd ask: Is there a way to to find the names of the variables passed as parameters in a function call? Assuming I have: function test(tmp1, tmp2) { // ... } var a; var b; test(a, b); I'd like to get an array like so: [a, b]. Strings would also be acceptable: ["a", "b"]. I do not w...

How to parse/extract data from a mediawiki marked-up article via python

Source Mediawiki markup Right now I'm using a variety of regexes to "parse" the data in the mediawiki mark-up into lists/dictionaries, so that elements within the article can be used. This is hardly the best method, as the number of cases that have to be made are large. How would one parse an article's mediawiki markup into a variety...

Image Extraction From PDF Document using Adobe Acrobat 5 API and .NET

I have seen tons of articles all around the web but nobody describes exactly how to use Adobe Acrobat 5 API with .NET to automate Image Extraction From a lst of PDF documents. Can anyone please help ? ...

Extract string from between quotations

I want to extract information from user-inputted text. Imagine I input the following: SetVariables "a" "b" "c" How would I extract information between the first set of quotations? Then the second? Then the third? ...

Extract links from a webpage using lxml, xpath and python

I've got this xpath query: /html/body//tbody/tr[*]/td[*]/a[@title]/@href It extracts all the links with the title attribute - and gives the href in FireFox's Xpath checker add-on. However, I cannot seem to use it with lxml. from lxml import etree parsedPage = etree.HTML(page) # Create parse tree from valid page. hyperlinks = parsedP...

Extract Video frames in Android

Possible Duplicate: Getting frames from Video Image in Android Hi, I wanted to know if it is possible to extract frames from a running Video in Android? I need to extract frames at regular intervals and send them for further processing. Would someone be able to find an answer for me? Thanks, Abhi ...