English 中文(简体)
我如何能够找到网站链接,如与雷管网。
原标题:How can I find website link like with regex
  • 时间:2010-08-01 11:25:44
  •  标签:
  • c#
  • regex

iii

我希望能找到网站链接,如:

www.yahoo.com
yahoo.com
http://www.yahoo.com
http://yahoo.com
yahoo.jp ( or any domain)
http://yahoo.fr

是否有办法追踪他们。

最佳回答

这条战火球的reg子应该能够做你想要的多数。 I m unsure about domain.tld,因为这非常模糊。

(?xi)

(                           # Capture 1: entire matched URL
  (?:
    [a-z][w-]+:                # URL protocol and colon
    (?:
      /{1,3}                        # 1-3 slashes
      |                             #   or
      [a-z0-9%]                     # Single letter or digit or  % 
                                    # (Trying not to match e.g. "URI::Escape")
    )
    |                           #   or
    wwwd{0,3}[.]               # "www.", "www1.", "www2." … "www999."
    |                           #   or
    [a-z0-9.-]+[.][a-z]{2,4}/  # looks like domain name followed by a slash
  )
  (?:                           # One or more:
    [^s()<>]+                      # Run of non-space, non-()<>
    |                               #   or
    (([^s()<>]+|(([^s()<>]+)))*)  # balanced parens, up to 2 levels
  )+
  (?:                           # End with:
    (([^s()<>]+|(([^s()<>]+)))*)  # balanced parens, up to 2 levels
    |                                   #   or
    [^s`!()[]{};: ".,<>?«»“”‘’]        # not a space or one of these punct chars
  )
)

http://daringfireball.net/07/improved_regex_for_matching_urls” rel=“nofollow noreferer” http://daringfireball.net/07/improved_regex_for_matching_urls

问题回答

我会在此提出一种选择,而不用说。 查阅HTML Agility Pack

var doc = new HtmlDocument();
doc.Load("file.htm");
foreach(HtmlNode link in doc.DocumentElement.SelectNodes("//a[contains(@href,  yahoo )]"])
{
  var href = link["href"];
  //href is a url that contains the word `yahoo`, do something with it
}

这实际上不是answering。 您书面提出的问题只是为了保持你的备选办法的开放性,即:。 适用于的“Ex”可产生许多其他问题。





相关问题
Anyone feel like passing it forward?

I m the only developer in my company, and am getting along well as an autodidact, but I know I m missing out on the education one gets from working with and having code reviewed by more senior devs. ...

NSArray s, Primitive types and Boxing Oh My!

I m pretty new to the Objective-C world and I have a long history with .net/C# so naturally I m inclined to use my C# wits. Now here s the question: I feel really inclined to create some type of ...

C# Marshal / Pinvoke CBitmap?

I cannot figure out how to marshal a C++ CBitmap to a C# Bitmap or Image class. My import looks like this: [DllImport(@"test.dll", CharSet = CharSet.Unicode)] public static extern IntPtr ...

How to Use Ghostscript DLL to convert PDF to PDF/A

How to user GhostScript DLL to convert PDF to PDF/A. I know I kind of have to call the exported function of gsdll32.dll whose name is gsapi_init_with_args, but how do i pass the right arguments? BTW, ...

Linqy no matchy

Maybe it s something I m doing wrong. I m just learning Linq because I m bored. And so far so good. I made a little program and it basically just outputs all matches (foreach) into a label control. ...

热门标签