Python: use regex to extract strings from text and put them into a list

Question

Welcome To Ask or Share your Answers For Others

Python: use regex to extract strings from text and put them into a list

1 Answer

深蓝 · Answer 1 · 2021-10-06T03:09:23+0000

Maybe it's because my regex-fu not as strong as some's but I'd do it in 2 stages and only use regex on the first.

import re

text_ = "some strings ( a ; b ; c ) some strings ( d ; e ; f ) and so on"

#extract anything bounded by parenthesis
pat1 =re.compile(r"(([^)]+))")
split1 = pat1.findall(text_)

def split(substr):
    """ dont need to be all fancy, split on ; after stripping the parenthesis """
    return [v.strip() for v in substr.lstrip("(").rstrip(")").split(";")]

result = [split(val) for val in split1]

print(result)

output:

[['a', 'b', 'c'], ['d', 'e', 'f']]

Alternatively, you can let the first regex exclude the parenthesis in the groups so that it simplifies your split function. Cleaner, same output.

pat1 =re.compile(r"(([^)]+))")
split1 = pat1.findall(text_)

def split(substr):
    """ dont need to be all fancy, split on ; after stripping the parenthesis """
    return [v.strip() for v in substr.split(";")]

Categories

Python: use regex to extract strings from text and put them into a list

Python: use regex to extract strings from text and put them into a list

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

output:

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags