Java Regex/run on groovy: how to extract all the captured blocks? | ansaurus

tags:

views:

46

answers:

1

Q:

Java Regex/run on groovy: how to extract all the captured blocks?

Hi, I'm trying to extract two blocks of code from a text file using Java regex. However, I can only extract the last block. Could some one point out what is is wrong with mycode?

thanks.

here it is

import java.util.regex.*;

INPUT_START_OR_BLANK_LINE = /(?:\A|\n\n)/
FOUR_SPACES_OR_TAB = /(?:[ ]{4}|\t)/
CODE = /.*\n+/
CODE_LINES = /(?:$FOUR_SPACES_OR_TAB$CODE)/
LOOKAHEAD_FOR_NON_CODE_LINE = /(?:(?=^[ ]{0,4}\S)|\Z)/

// this regular expression will find all of the consecutive code lines in a markdown file
// in a markdown file, if the line starts with a tab or at least 4 spaces, it's a code line
// slightly modified from one in markdownj
// see: http://github.com/myabc/markdownj/tree/master/src/java/com/petebevin/markdown/MarkdownProcessor.java
MARKDOWN_CODE_BLOCK = "(?m)" + 
                      "$INPUT_START_OR_BLANK_LINE" +
                      "($CODE_LINES+)" +
                      "$LOOKAHEAD_FOR_NON_CODE_LINE"

def text="""
  Normal paragraph ....


    first Code block begin
       all codes
    first code block end

    how about this line?

   that is not good but what we have are very important
for the purpose of text. yes, we are good.


    second Code block begin
       all codes
    second code block end
    how about this line?

Normal returns
"""

 Pattern p = Pattern.compile(MARKDOWN_CODE_BLOCK);
 Matcher m = p.matcher(text); 

while (m.find() == true){
 m.group().eachLine {println it}
}

the code was adopted from http://naleid.com/blog/2009/01/01/using-groovy-regular-expressions-to-parse-code-from-a-markdown-file/

A:

Pattern regex = Pattern.compile("((    |\t).*(\r\n|\r|\n))*");
Matcher regexMatcher = regex.matcher(subjectString);
while (regexMatcher.find()) {
    System.out.println(regexMatcher.group());
}

Mike Clark 2010-10-30 02:02:01

related questions

Java Time Zone is messed up

Eclipse on win64

Automate builds for Java RCP for deployment with JNLP

Why are professors or schools picking Java over C++ to teach to students?

Is there a real benefit of using J#?

Public/Popular Websites using JavaServer Faces

Why can't I use a try block around my super() call?

Accessing post variables using Java Servlets

Personal Linux web server

Is this really widening vs autoboxing?

How can I Java webstart multiple, dependent, native libraries?

Why can't I call toString() on a Java primitive?

How do I use Java to read from a file that is actively being written?

What code analysis tools do you use for your Java projects?

IllegalArgumentException or NullPointerException for a null parameter?

How do I configure and communicate with a serial port?

What is the best way to parse strings in Java

Getting started with a custom JXTA PeerGroup

Creating a custom button in Java

How to get started "writing" a code coverage tool?

Which Build-/Configuration Management Tool?

What is the difference between an int and an Integer in Java/C#?

What is the meaning of the type safety warning in certain Java generics casts?

How would you access Object properties from within an object method?

Converting CSV File to XML in Java