Uploaded image for project: 'Mahout'
  1. Mahout
  2. MAHOUT-804

Each page in Mahout's Confluence Wiki has 2 URLs, with differing page styles and search behaviours

    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Fixed
    • 0.6, 0.7
    • 0.8
    • classic

    Description

      There are two styles of URL in circulation for URLs into Mahout's Wiki (presumably an Apache-wide configuration issue):

      https://cwiki.apache.org/MAHOUT/svd-singular-value-decomposition.html vs
      https://cwiki.apache.org/confluence/display/MAHOUT/SVD+-+Singular+Value+Decomposition

      They appear to be the self-same confluence 3.4.9 installation (or its raw filetree). Each has a different search box at the top of the page. The version with 'confluence/' in the path does a confluence search, and returns similar URLs as results. The one with '.html' suffixes does a domain-constrained Google search.

      Despite markup canonicalising the confluence variant, ie. <link rel="canonical" href="https://cwiki.apache.org/confluence/display/MAHOUT/SVD+-+Singular+Value+Decomposition"> appearing in the confluence pages, it seems the Google search results typically throw people into the other version of the Wiki site.

      This is all mildly confusing, mildly annoying but overall mostly harmless. It could be having some negative impact on google rank & suchlike, since incoming links will be split between the two styles. Maybe this could be passed along to the Wiki admins?

      Which version does the Mahout team consider canonical URLs (for external links etc)?

      Attachments

        Activity

          People

            Unassigned Unassigned
            danbri Dan Brickley
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: