extract

PHP How to extract part of given string?

Hi, I'm writing a search engine for my site and need to extract chunks of text with given keyword and few words around for the search result list. I ended with something like that: /** * This function return part of the original text with * the searched term and few words around the searched term * @param string $text Original text ...

Is there a server-side dom engine suitable for crawling?

I found a project, jaxer which embeds Firefox's JavaScript engine on the server side, so it can parse HTML server-side very well. But, this project seems dead. It is really helpful for crawling web pages to parse HTML & extract data. Is there some new technology useful for extracting information? ...

Using a batch file to extract the variable part of a string to use in renaming txt files

I'm using batch files to process text files with names of the form: CLL*1.txt, CLLM*2.txt located in a specific "download" folder. All files contain a string of the form: "File Reference : 0xxxx", where xxxx is a unique numerical identifier. I am trying, without much success, to use the following script to rename the file to CLL*xxxx...

extract specific string in NSString

hi, i think it is a simple question but i dont know how to solve it. i have a NSString that containes html content. i want to extract some tags. NSString *string=@"test some text <img src='http://www.xyz.com/a.jpg' > blah blah <a href='asdfg'>aaaa</a> bbbb cccc"; i want to take img & "a" tag into new string. then i will display it i...

How to copy data encrypted by dbms_obfuscation_toolkit.DESEncrypt

Hi, I have an Oracle (10.2.0.4) database table with a column which is encrypted by dbms_obfuscation_toolkit.DESEncrypt tool kit. Some of our data has been messed up by it getting re-encrypted with another key. I want to do some testing on this data to try and recover it. Therefore, I want to copy the data from our live system and into...

Regex: Extracting readable (non-code) text and URLs from HTML documents

I am creating an application that will take a URL as input, retrieve the page's html content off the web and extract everything that isn't contained in a tag. In other words, the textual content of the page, as seen by the visitor to that page. That includes 'masking' out everything encapsuled in <script></script>, <style></style> and <!...

How can I extract URLs from plain text in Perl?

I've seen some posts like this, but not exactly what I want to do. How can I extract and delete URL links, and then remove them from plain text. Example: "Hello!!, I love http://www.google.es". I want extract the "http://www.google.es", save it on a variable, and then, remove it from my text. Finally, the text has to be like that: ...

Extract metadata from old Word files (from 2.0 onwards)

Hi. I have to extract metadata from a lot (my small working sample has hundreds, the total will probably be thousands) of Microsoft Office files, mostly Word ones. These files Word versions go from Word 2.0 to Word 2007. I have to do it in .net 3.5 (using c#) and it's a local winforms application. I can extract metadata from the most...

Perl: Extract data from logs into a report

I have some logs in a directory: /test/report. The logs are named: a.log, b.log, c.log. The content of a.log is below: Input 1 ---- Number of records := 101 Input 2 --- Num of of records := 101 Input 3 --- Num of records := 101 The content of b.log is below: Input1 ------- Number of records := 88 Input 2 ----- Num of of rec...

PDF Text Extraction at hyperlink locations

Anybody know of a (FREE) SDK of some sort that can start text extraction at the point in the PDF Document where a hyperlink takes you to (within the same PDF document)? The links end up taking us to specific points on specific pages. More specifically we need a program that can parse a pdf document that holds questions and answers to a...

ExtractIconEx all icon sizes vb6 including 256

Hello guys, I need to extract all Icons sizes up to 256x256 from ico,exe,dll,icl files ExtractIconEx only extracts 32x32 and 16x16 icons thanks for helping ...

problem with using extract php function?

<?php function check($user_id, $topic_id){ $query = mysql_query("SELECT user_id, topic_id FROM sa where topic_id='$topic_id' and user_id='$user_id'"); if (mysql_num_rows($query)==1){ return 'you have already voted'; } else { $against = ' <li> <button type="submit" value=...

Read data from multiple rows PHP MySQL

I have a database to keep track of the services clients have. But need to be able to find all the services (on different rows) and display them in an HTML table. i.e. Service Company Status ------------------------------------- Service 1 Company 1 Complete Service 2 Company 2 Pending Service 3 Company 1 ...

JSoup - Select all comments

Hey, I want to select all comments from a document using JSoup. I would like to do something like this: for(Element e : doc.select("comment")) { System.out.println(e); } I have tried this: for (Element e : doc.getAllElements()) { if (e instanceof Comment) { } } But the following error occurs in eclipse "Incompatible condi...