English 中文(简体)
网站分析工具如何工作?[关闭]
原标题:
  • 时间:2009-02-04 07:09:16
  •  标签:

I am in process of gathering information about web analytics tools (like Google Web Analytics) for my next assignment, but I am not able to find any good information. I am looking for:

  1. Key terms used.
  2. What all mediums are available for data collection and How they works.
  3. Any reference books, white papers etc (technical and non technical both).
  4. Any open source implementation (especially in .NET).
最佳回答

这里是使用的关键术语:

  • Hit (internet)
  • Page view
  • Visit / Session
  • First Visit / First Session
  • Visitor / Unique Visitor / Unique User
  • Repeat Visitor
  • New Visitor
  • Impression
  • Singletons
  • Bounce Rate
  • % Exit
  • Visibility time
  • Session Duration
  • Page View Duration / Time on Page
  • Page Depth / Page Views per Session
  • Frequency / Session per Unique
  • Click path

使用的方法:

  • Web server logfile analysis
  • Page tagging

网页服务器日志分析

在这种方法中,您编写脚本来从日志文件中获取详细信息,然后将其写入您的数据库。这种方法不会提供实时统计数据。您可以在这里阅读有关Web日志分析软件的更多信息

页面标记

添加 JavaScript 代码或图片,然后使用代码获取页面、referrer、访客等所有细节。

...these were images included in a web page that showed the number of times the image had been requested, which was an estimate of the number of visits to that page. In the late 1990s this concept evolved to include a small invisible image instead of a visible one, and, by using JavaScript, to pass along with the image request certain information about the page and the visitor. This information can then be processed remotely by a web analytics company, and extensive statistics generated...

如果您正在自己的网站上使用分析工具,您可以使用由Eytan Levit提供的代码

信用 维基百科。更多信息可以在那里找到。

问题回答

好的,

我不是专家,但以下是一些常见数据,您可以检索以构建自己的分析:

string str;
str += "Refferer:" + Request.UrlReferrer.AbsolutePath.ToString() + "<BR>";
str += "Form data:" + Request.Form.ToString() + "<br>";
str += "User Agent:" + Request.ServerVariables["HTTP_USER_AGENT"] + "<br>";
str += "IP Address:" + Request.UserHostAddress.ToString() + "<BR>";
str += "Browser:" + Request.Browser.Browser + " Version: " + Request.Browser.Version + " Platform: " + Request.Browser.Platform + "<BR>";
str += "Is Crawler: " + Request.Browser.Crawler.ToString() + "<BR>";
str += "QueryString" + Request.QueryString.ToString() + "<BR>";

您也可以像这样解析用户访问您的网站的关键词:

protected string GetKeywordFromReferrer(string url)
{
    if (url.Trim() == "")
    {
        return "no url";
    }
    string urlEscaped = Uri.UnescapeDataString(url).Replace( + ,    );
    string terms = "";
    string site = "";

    Match searchQuery = Regex.Match(urlEscaped, @"[&?][qp]=([^&]*)");
    if (searchQuery.Success)
    {
        terms = searchQuery.Groups[1].Value;
    }
    else
    {
        Match siteDomain = Regex.Match(urlEscaped, @"http://(.+?)/");
        if (siteDomain.Success)
        {
            site = siteDomain.Groups[1].Value;
        }
    }
    if (terms != "")
    {
        return terms;
    }
    if (site != "")
    {
        return site;
    }

    return "Direct Access";

}

希望这能有所帮助。

1. Key terms used
As with answer 1

2. What all mediums are available for data collection and How they works.
Log files from Apache, IIS. HTTP Handlers for ASP.NET, or your actual page. Javascript includes (the objects available to Javascript give you most information you need about the client)

3. Any reference books, white papers etc (technical and non technical both)
The RFC on HTTP is useful, that gives you most of the request headers that are capturable.

任何开放源代码实现(特别是在.NET中)。

我写了一个分析的解析部分(在我看来是最难的部分)。它需要在某些领域进行一些微调,因为它已经四年了。

它缺少一个DAL,听起来比实际情况更难 - 主要障碍是确保您不复制每行日志具有的精确数据,因为那样您可能会直接使用日志文件。另一部分是以良好的格式显示这些聚合数据。我的目标是将其存储在SQL Server中,并以db4o格式为较小的网站提供服务。

Statmagic项目令人悲伤的部分是谷歌进入了市场,并彻底摧毁了竞争对手,因此我已经没有继续完成它的必要了。





相关问题
热门标签