English 中文(简体)
Regex Base URL Grabbing
原标题:Regex Base URL Grabbing
  • 时间:2012-04-26 02:09:44
  •  标签:
  • regex

I m试图过滤ur,发现其底色,包括www或任何预设装置,有困难写出供捕的表述,但随着TLD的分流,它成为一个相当复杂的问题。

answers.yahoo.com => yahoo.com
www.google.com => google.com
uk.answers.yahoo.co.uk = > yahoo.co.uk
www.g.se => g.se

任何建议?

我使用了这一表述,但当域名不超过2个特性时,或者当域名小于2个特性时,它就进行了评估。

(?P<domain>[a-z0-9][a-z0-9-]{1,63}.[a-z.]{2,6})$
最佳回答

你们怎样知道,uk.answers.yahoo.co.uk的基底是ya.co.uk,但例如, f.bar.maps.google.com是tmap.google.com?

问题回答
[^.]*.(?:co.uk|w{2,3})$

你们需要增加在监管机构中已知的领域。

http://regexr.com?30p4r





相关问题
Uncommon regular expressions [closed]

Recently I discovered two amazing regular expression features: ?: and ?!. I was curious of other neat regex features. So maybe you would like to share some tricky regular expressions.

regex to trap img tag, both versions

I need to remove image tags from text, so both versions of the tag: <img src="" ... ></img> <img src="" ... />

C++, Boost regex, replace value function of matched value?

Specifically, I have an array of strings called val, and want to replace all instances of "%{n}%" in the input with val[n]. More generally, I want the replace value to be a function of the match ...

PowerShell -match operator and multiple groups

I have the following log entry that I am processing in PowerShell I m trying to extract all the activity names and durations using the -match operator but I am only getting one match group back. I m ...

Is it possible to negate a regular expression search?

I m building a lexical analysis engine in c#. For the most part it is done and works quite well. One of the features of my lexer is that it allows any user to input their own regular expressions. This ...

regex for four-digit numbers (or "default")

I need a regex for four-digit numbers separated by comma ("default" can also be a value). Examples: 6755 3452,8767,9865,8766,3454 7678,9876 1234,9867,6876,9865 default Note: "default" ...

热门标签