Jump to content
The Education Forum

Wikipedia and Search-Engines


John Simkin

Recommended Posts

The Guardian reported yesterday that Jimmy Wales has “declared that every outgoing link from Wikipedia should have a ‘no follow’ tag.” It is claimed that the reason for this is that spammers have tried to exploit Wikipedia by placing links in order to increase search-engine rankings. This is clearly not true. Editors can deal with spammers. If links provide useful information, they should be allowed to remain. Links should also be used to substantiate information in the narrative as references.

The real reason that Wikipedia does not want to use these links is that it shows the way the encyclopedia steals information from other websites. When I have tried to expose this activity by placing links to my pages, they have been removed and I have been accused of being a spammer. As we have seen on the thread below, when others have attempted to do this, they have been banned from editing Wikipedia.

http://educationforum.ipbhost.com/index.php?showtopic=8861

Wikipedia have now got their position at the top of all search-engines for virtually any search. For example, take the case of former CIA operative “Theodore Shackley”. If you do a search at Google, Wikipedia, comes first.

http://en.wikipedia.org/wiki/Theodore_Shackley

Compare the detail of the Wikipedia with my page that appears in 7th place.

http://www.spartacus.schoolnet.co.uk/JFKshackley.htm

Note the number of links that I give in the page, including several links to Wikipedia. What page would a student find most useful?

More importantly, look at the page that appears in 4th place.

http://www.answers.com/topic/theodore-shackley

This is a complete copy of the Wikipedia page. The only difference is that this page contains adverts. Is this an example of Jimmy Wales making money from the many people who created the original Wikipedia article?

Link to comment
Share on other sites

The Guardian reported yesterday that Jimmy Wales has “declared that every outgoing link from Wikipedia should have a ‘no follow’ tag.” It is claimed that the reason for this is that spammers have tried to exploit Wikipedia by placing links in order to increase search-engine rankings. This is clearly not true. Editors can deal with spammers. If links provide useful information, they should be allowed to remain. Links should also be used to substantiate information in the narrative as references.

To clarify: this policy change at the English Wikipedia ('no follow' used to apply only to Wikipedia's Talk pages) is not intended to affect the usage of outgoing links. Such links should remain. The 'no follow' is only a traffic sign put up that search engine spider programs will follow. Human users will notice nothing.

The point is to deter spammers who are only after better page ranking for their own sites. Wikipedia being an almost entirely volunteer operation, 'editors can deal with spammers' is true only up to a point.

By the way, the Answers.com mirror site (one of many WP mirrors) is run by a NASDAQ-quoted company: see

http://en.wikipedia.org/wiki/Answers.com

Edited by Charles Matthews
Link to comment
Share on other sites

By the way, the Answers.com mirror site (one of many WP mirrors) is run by a NASDAQ-quoted company: see

http://en.wikipedia.org/wiki/Answers.com

They are therefore guilty of taking the material produced by people free of charge (often stolen from other people's websites). Jimmy Wales cannot put advertising on his own website because people would understandably complain that he is making money from his volunteers. However, there is nothing to stop Answers, paying Wales money for taking Wikipedia material.

What I do not understand is how Answers get such high ranking for producing a duplicate page. I thought Google was supposed to have technology to stop this happening.

Answers is not the only website stealing Wikipedia material (although it is the most successful with its search-engine rankings). This is a list provided by Wikipedia Review:

www.reference.com

www.arthistoryclub.com

www.quickseek.com

www.omnipelagos.com

www.chemistrydaily.com

www.historymania.com

www.all-dictionaries.com/encyclopedia

www.worldwardiary.com

www.painreliefchat.com

www.gardeningdaily.com

150games.com

1911encyclopedia.org

1911ency.org

1bx.com

2020site.org

207.150.180.135

208.11.77.182

209.120.132.57

209.172.55.126

43people.com

43things.com

500ml.org

50megs.com

702.co.za

8notes.com

aaca.org

abacci.com

abcd-classics

abebooks.com

abook4all.com

aboutrufus.com

about-thermodynamics.com

absoluteastronomy.com

academickids.com

achome.co.uk

adamhershposters.com

adventuresinthearts.com

air-cooledstuff.info

airmynyorks.co.uk

ajaxian.com

alanmacek.com

alfredom.com

allaboutparkinsons.com

allartonline.com

alldeaf.com

allinsongallery.com

ambrand.com

amcoz.com.au

amica.org

amug.org

ancestralauthor.org

andregide.org

andrejkoymasky.com

andynetworkaw.com

angel-drawing.com

answerbag.com

answers.com

answerway.com

antiqbook.com

anythingarkansas.com

aol.com

apachelabs.org

apob.co.uk

appbio.net

archira.com

archivesprints.com

arikah.com

artchive.com

artilifes.com

artinthepicture.com

artist-info.com

artistopia.com

artistwd.com

artline.ro

artrenewal.org

artsaha.org

artsheaven.com

artspan.com

artspecialist.co.uk

ashmol.ox.ac.uk

askfactmaster.com

askmore.net

askmytutor.co.uk

attorney-tax.net

audiosparx.com

babelmatrix.org

baheyeldin.com

ballowax.com

barganews.com

basicfamouspeople.com

bath.ac.uk

battle-fleet.com

bbc.co.uk/dna/collective

bbc.co.uk/dna/ww2

beauty-supply.com

bellsouthpwp.net

bendigolive.com

benthamlinks.com

bestpriceart.com

bible-history.com

bible-researcher.com

biblio.com

bidorbuy.co.za

bikeaccess.net

billbam.com

biocrawler.com

biography.ms

biographyonline.net

bivouac.com

bizqte.com

blackcrayon.com

blacklooks.org

blackmask.com

blackwell-synergy.com

blantyrepast.com

blaukatze.info

blip.tv

blog

bluebird-electric.net

blujay.com

b-n.nl

bobdylantalk.com

bookcrossing.com

bookish.dk

book-of-thoth.com

bookrags.com

bookreviewof.com

booksearchisbn.com

booksearchprice.com

booksfactory.com

booksforzip.com

borgfind.com

boxing-memorabilia.com

boxxbuy.info

bradfordkingandcompany.com

brainparad.com

brainsip.com

brainyencyclopedia.com

breckinridge.com

brillig.com

britishcinemagreats.com

btinternet.co.uk

bullischarterschool.com

bumc.net

buscasitios.com

buy.com

buy-neon.com

byblos.uk.com

cafepress.com

calculator.info

calendarshoppe.com

canadaka.net

canadiancontent.net

canim.net

care2.com

careerbuzz.com

cartage.org.lb

catholicmission.org

catholic-single.info

cellist.nl

celloheaven.com

cello.org

centropa.org

cerebro.com

cgs.org

charlesstewart.ie

chiasmus.com

china1900.info

chinadaily.com.cn

chrisknight.info

christdot.org

circleofspecialfriends.com

cityofyoungstown.com

civilwarautographs.com

classicalanglican.net

classiccat.net

coalitionoftheswilling.net

cobourghistory.ca

co.greene.pa.us

coin-gallery.com

collectionscanada.ca

communicera.nu

cooleratom.info

coolquiz.com

coredump.cx

coskunfineart.com

cosmetics

cowanauctions.com

credit-insurance.us

credit.net

crimebase.co.uk

crossroad.to

crystalinks.com

cswap.com

cultivategreatness.com

cupped-expressions.net

customtermpapers.org

cyberpathway.com

cyberpedia.net

dailyguitar.com

dailykos.com

danask.com

danwymanbooks.com

datapan.com

daviddarling.info

dcgiftshop.com

democraticunderground.com

demonosia.com

design-technology.info

did-you-mean.com

digasig.com

directtextbook.com

discardedlies.com

discoverhollywood.com

dlife.com

domeisland.com

donkeylink.com

donob.com

dorchesteratheneum.org

dynup.net

earthstores.com

easyheight.com

eat-online.net

ebay.com

ebay.co.uk

ebay.se

ebiblepda.com

ebiog.com

ebookcdrom.com

ebooks-library.com

echeat.com

eclassical.com

ecoledevie.ne

economicexpert.com

economy-chat.com

economyprofessor.com

editionsilvertrust.com

edu-guide1.info

egnu.org

egyptshrine.org

ehistorybuff.com

elib.com

empirepost.com

encyclopedian.com

encyclopedia-titanica.org

englishverse.com

enjolrasworld.com

epinions.com

equityedu.com

esmartweb.com

essayempire.com

essential-architecture.com

estelle.tv

everything2.com

excite.co.uk

exodusbooks.com

experiencefestival.com

explanation-guide.info

ezboard.com

factbites.com

fact-index.com

factorielle.free.fr

factquote.com

fada.com

famousbelgians.net

famouspeople.co.uk

famouspoetsandpoems.com

famous.tc

feedburner.com

fernandobotero.biz

fictionwise.com

finance-guidance-upshot.info

findfamous.com

find.hm

findlaw.com

fine-art-sales.co.uk

firewallpedia.com

flickr.com

florence.ala.it

focusdep.com

fom.ru

foosquare.com

for68.com

forgeriesandhoaxes.com

fortunecity.com

forum

francesfarmersrevenge.com

frath.net

freebaptist.net

freeblog.hu

freedomcenter.org

freeones.com

freerepublic.com

free-scores.com

freewebs.com

freshfiction.com

ftppro.com

xxxxed.se

funtrivia.com

futura-dtp.dk

futureeducation.net

futurism.org.uk

fwrobertson.com

gallerygiselle.com

gamelow.com

gaple.com

garthclark.com

gencircles.com

genesis.ac.uk

geocaching.com

geology-books.com

geometry.net

georgeglazer.com

germannotes.com

gif-spacer-used.info

giurisprudance.com

glasglow.com

goantiques.com

goldbamboo.com

goldmark.org

googe.com.sg

google.com

gopabandhudas.com

gotohoroscope.com

governpub.com

gradinamea.ro

graybooksellers.com

great-song-stylists-uk.com

grist.org

grosell.dk

grosvenorprints.com

groups.google.co.uk

groups.msn.com

habite.com

hackpenrecords.com

hadac.sk

hair-loss-propecia.info

hannibal.net

hanovercomputer.com

harlanjberk.com

harvardmag.com

hauntedamericatours.com

haydn.dk

higherpraise.com

hipspeck.info

historycentral.com

historyhome.co.uk

hoasm.org

home-insurance.info

home-loan.net

homestayfinder.com

hometown.aol.de

hopelessly-devoted.org

hopto.org

horsesoldier.com

hschamberlain.net

hull.ac.uk

hut.ru

ibiblio.org/chineese

ideafinder.com

ifbd.net

iicoc.com

ilabdatabase.com

ilab-lila.com

ilab.org

illuminated-books.com

illuminatedbooks.com

iloveindia.com

imdb.com

iment.com

impressionistartist.info

incois.gov.in

indymedia.org

infoax.biz

infoax.com

infoflier.com

infofx.net

infography.com

informationgenius.com

infothis.com

infotut.com

infovx.net

ingentaconnect.com

insurance-lorry.net

integrativespirituality.org

invisionzone.com

ipop.co.kr

ipupdater.com

iridis.com

irresponsiblecybernetics.com

is300-lexus.info

isbn.nu

islam.com

italylink.com

itgo.com

itsrealfla.com

jahsonic.com

janda.org

janenightwork.com

jazzsports.com

jbautographs.com

jewelry.net

jewishgates.com

jiggies.com

jmucci.com

john-demartini.com

journalspace.com

jrank.org

k12.mn.us

k12.va.us

k12.wi.us

kalisz.pl

kanvasdigital.net

karadar.com

kargesfineart.com

keller-good-information.info

kent.ac.uk

keywen.com

kidsauckland.com

kilruanemacdonaghs.com

kingkong.demon.co.uk

kittybrewster.com

kli.ac.at

kneeguru.co.uk

knowmore.org

kopete.org

kosciuszkofoundation.org

kouroo.info

krehbielart.com

kstrom.net

kwintessential.co.uk

la-maison-francaise.org

last.fm

latifm.com

lavamus.com

lawyer-tax.net

left-wing.net

legendsofamerica.com

leninimports.com

lesliesacks.com

levydweck.com

libcom.org

librarium.nl

libraryjournal.com

library.org.au

librarything.com

life-insurance

lightdev.com

lindenheuvel.org

litcollection.com

literary.blogsky.com

literatureclassics.com

literaturevault.com

livejournal.com

livinglifefully.com

loeb-larocque.com

logoslibrary.eu

logosquotes.org

londonfoodfilmfiesta.co.uk

londonmuseum.on.ca

longbeachopera.org

lookingglassnews.org

lostrivers.ca

lottaliving.com

lovelandia.com

lucky7shop.com

lulu.com

luminarium.org

lumpen.com

lycos.com

lycos.co.uk

lycos.de

lyricsplayground.com

mab-x-music.com

madinpursuit.com

magicandillusion.com

magicbullet.org

magicians.nasze.net

magnamusic.com

magnoliaplantation.com

mahoneysgalleries.com.au

majicape.com

malaspina.com

malaspina.org

manstouch.com

maphist.com

mariner.org

martinfrost.ws

martinopublishing.com

mart-object.com

marxists.org

massnurses.org

masterliness.com

masterworksartgallery.com

mathematicianspictures.com

mattiajona.com

mavicanet.ru

maxwellswebmedia.com

mbceo.com

mdt.co.uk

medbib.com

medlib.com

med-help.info

medicalrace.com

meetup.com

mefeedia.com

menurestaurant.info

mesweet.net

metronjournal.it

milechai.com

mindbit.com

mindfirerenew.com

mississippi.net

mlahanas.de

mobiletopsoft.com

modernista.cz

monet-reproductions.com

monon.org

motivatedaddy.com

mouseland.org

mrfixitonline.com

mrsci.com

msim.org.uk

multieducator.com

multiply.com

multitrivia.net

mumfordbooks.co.uk

museoblaisten.com

museum-reproductions.com

musicabona.com

musicplayer.com

mutual-life.net

muzejvrsac.org.yu

muziekdriedaagse.nl

mycivilwar.com

mysite.verizon.net

myspace.com

mythichawaii.com

mytton.org

mywarof1812.com

mywebpage.netscape.com

nahc.org

nahravky.sk

nalanda.nitc.ac.in

namnewsnetwork.org

napoleonguide.com

nationalartsclub.org

nationalfilmnetwork.com

nationalgalleries.org

nationmaster.com

naxos.com

nccu.org.uk

neareasternarchaeology.com

neon-signs.info

net95.com

netbenefit.co.uk

netclusive.de

netglimse.com

netipedia.com

netlexfrance.info

netweed.com

networkinaustin.com

new-classics.co.uk

newpoland.com

newportharborhigh.com

newsfinder.org

newsscan.com

newstodaynet.com

newtechhigh.com

newyorkcityaction.com

ngic.re.kr

nhbs.com

nndb.com

nobpeace.info

nobslinks.com

nomadlife.org

norskportalen.com

northernblue.ca

nosubject.com

nuclearspin.org

nutcote.demon.co.uk

nutri.info

nyc-architecture.com

nytimes.com/books/first

nzbirds.com

ofletters.com

ohiohistory.com

ohmfree.com

oingo.com

olapreport.com

oldworldauctions.com

onesongeveryday.com

online-literature.com

online-personals

onlineseats.com

onpedia.com

open.org

open-site.org

opinionatedlesbian.com

oppapers.com

orissaindia.com

orls.org

ostrowski.cc

other-waters.com

ourcivilisation.com

ourheritage.net

overstock.com

owainphyfe.com

painfullycool.com

painreliefchat.com

paleorama.com

palettesofvision.com

palthai.com

paper-truck.net

papierdoll.net

parhasard.net

paydan-loan.info

pbagalleries.com

pedia.walla.co.il

pegasusgallery.ca

perfumecountry.com

pheeds.com

phil-books.com

philosophyprofessor.com

phschool.com

phthiraptera.org

phy.bg.ac.yu

pianoparadise.com

pianosociety.com

piglettown.com

pittsburghsymphony.org

planetarios.com

plastiquarian.com

pocketgear.com

poemhunter.com

poemofquotes.com

poetseers.org

polishwashington.com

politicalfriendster.com

politicalquest.org

polymernotes.org

portitude.org

portraitartist.com

portsunlight.org.uk

postergroup.com

powells.com

powerwissen.com

ppne.co.uk

prbm.com

prince.org

promotega.org

property2u.com

proz.com

pscelebrities.com

psigate.ac.uk

psychology.org

psychosynthesis.org

publishersrow.com

punweb.com

puritanhead.com

pyreneesguide.com

qardinalinfo.com

queertheory.com

quickseek.com

quotationsbook.com

quotesdb.info

r-3.com

railsplitter.com

readprint.com

realliteraturedir.com

reference.com

remediosvaro.biz

renaissanceconnection.org

renownedart.com

resolve3d.com

reviewpainting.com

revolutionaryplayers.org.uk

rf-champagne.com

riapress.com

ricochet-jeunes.org

ridorlive.com

ringtones

robertfrew.com

robertmusil.com

roembus.org

rollanet.org

romaniaunog.org

ronaldbrucemeyer.com

rootsweb.com

rosings.com

royal-navy.org

rusnet.nl

russianaudiobooks.com

russiandvd.com

safran-arts.com

sahistory.org.za

sah.org

saltairevillage.info

sanatan.org

sandersofoxford.com

satori.lv

sca.org

scholarsbookshelf.com

schoolnet.co.uk

sciaga.pl

scipeeps.com

scotclans.com

scotlandonline.com

scripophily.net

searchabook.us

seds.org

sermonillustrations.com

sermonindex.net

server-home.org

setcom.sk

sex-toy-party.net

shaksper.net

shetlopedia.com

shopping.com

shopping.msn.com

siegelauctions.com

sierra-arts.net

sierranevadavirtualmuseum.com

signsandshirts.co.uk

silentsaregolden.com

silkworld.nl

simplyaudiobooks.com

singaporemoms.com

sistersofmercy.org

sitesled.com

slider.com

snopes.com

snyke.com

societies.csc.tcd.ie

solarnavigator.net

songwritershalloffame.org

sorocabana.net

sosantikvarium.hu

soton.ac.uk

soulwalking.co.uk

soundandvisionmag.com

soundtrack.info

southerncrossreview.org

southsearepublic.org

southwilts.com

spacefellowship.com

sparknotes.com

speedace.info

spiderbites.about.com

spiritualsciencebiblestudies.org

spiritus-temporis.com

squaldrina.com

squarespace.com

squidoo.com

stagesb.org

stalky.com

st-and.ac.uk

standingrocktourism.com

starpulse.com

startsurfing.com

statistics4u.info

stephenkishel.com

stevengraphs.com

stjamespaddington.org.uk

stockton.sch.uk

stoneflint.com

stpaulsschool.org.uk

strategicmarketingmontreal.ca

stream-divx.com

studyplans.com

stumbleupon.com

suite101.com

sunrisediving.net

supertix.com

supertopo.com

swcast.net

swirve.com

szm.com icarusindie.com

tagate.com

tamilnation.org

teachingamericanhistory.com

tecsoc.org

tejones.net

tektonics.org

termpaperslab.com

the-athenaeum.org

theatredatabase.com

theatrehistory.com

thebestlinks.com

thebeststuffintheworld.com

thebibleproject.com

thecatalyst.org

thecemeteryproject.com

thecomdaily.com

thecommonvirtue.com

thefiveminuteguide.com

thelatinlibrary.com

theleftcoaster.com

thelemapedia.org

themotorpool.net

themystica.com

theopedia.com

thepirateking.com

theremnanttrust.com

thescarf.org

theunjustmedia.com

theviolinsite.com

theweeweb.co.uk

thirdworldtraveler.com

thisisdiopter.org

thocp.net

thumperscorner.com

thylazine.org

ticketspot.com

tifr.res.in

timelineindex.com

tippe.com

tiptopwebsite.com

tmbl.gu.se

tnportraits.org

todayinsci.com

toeflthailand.com

tokencoins.com

tomchao.com

tomfolio.com

top100.cn

topofart.com

topxml.com

torahmitzion.org

torontoirishplayers.org

tourtempo.com

transformation.co.uk

transportstation.org

travellersinegypt.org

travelsos.net

trekearth.com

trevorphilip.com

tribalsmile.com

tribe.net

tribuneindia.com

tricolore.net

trinityofiniquity.com

triplelproductions.com

tripod.com

trocadero.com

trussel.com/hearn

tscholars.com

tsolive.org

tudorplace.com.ar

tutorgig.com

twoheadedcalf.org

typeditions.com

typeforge.net

typepad.com

typotheque.com

uepengland.com

uihealthcare.com

uik.ba

ukbookworld.com

ulsterbiography.co.uk

uni-duisburg.de

union-llc.net

universia.net

unsv.com

uplink.space.com

urbandictionary.com

usanethosting.com

usa-patriotism.com

useless-knowledge.com

users.adelphia.net

users.visi.net

usgovernetics.com

usgovi.com

usm.k12.wi.us

usroots.com

utata.org

utenti.quipo.it

utilitarianism.com

utilitarian.net

uvector.com

vahistory.org

vampirefreaks.com

vandekar.com

vandervenartbooks.com

vangoghgallery.com

vanmieghemmuseum.com

varoregistry.com

vauxhallsociety.org.uk

vedicbooks.net

verso-nlr.com

vestigatio.com

veteranlove.com

vh1.com

vias.org

vicmart.com

victorianweb.org

vigyanprasar.gov.in

villageantiques.ch

ville-ge.ch

violaheaven.com

violinmp3.com

virtualnyctour.com

virtualology.com

virtualtourist.com

visitcumbria.com

visitdunkeld.com

visitgeelong.org

visitingdc.com

visualartsadvisory.com

visual-mp3.com

visualstatistics.net

vnts.nl

vobam.se

vonl.com

vox.com

vroomjournal.com

wackipedia.com

wahooart.com

wallacecollection.org

wallace.org

waltwhitmanpoetryfestival.com

wandakelly.com

wangjianshuo.com

wapedia.org

warez.ge

waymarking.com

waza.org

webdelsol.com

webenetics.com

webhostingtalk.com

webscot.co.uk

websign.sk

website-awards.net

web-sms.info

webspawner.com

well.com

westegg.com

westvancouver.org

wetcanvas.com

wga.hu

whatis.tv

what-means.com

whereincity.com

whitneygen.org

whonamedit.com

wiki

williamsandson.com

winterspioneer.com

wkfinetools.com

wnbaboston.org

woodland-school.org

worcesterart.org

wordiq.com

wordlab.com

wordpress.com

wordsonline.org

worldcatlibraries.org

worldhistory.com

world-of-celebrities.com

worldsearch.com

worldslastchance.com

wwwhubs.com

xanga.com

xhost.ro

yahoo.com

yahoofs.jp

yamasun.com

yfu.nl

youtube.com

yukonterritorycanada.ca

zacker.com

zamir.org

zefrank.com

zeigermann.com

zeiss.de

zip.com.au

zmag.org/search

zoomclouds.com

zoominfo.com

zroorz.com

zuluwar.com

zvab.com

Link to comment
Share on other sites

By the way, the Answers.com mirror site (one of many WP mirrors) is run by a NASDAQ-quoted company: see

http://en.wikipedia.org/wiki/Answers.com

They are therefore guilty of taking the material produced by people free of charge (often stolen from other people's websites). Jimmy Wales cannot put advertising on his own website because people would understandably complain that he is making money from his volunteers. However, there is nothing to stop Answers, paying Wales money for taking Wikipedia material.

Would be nice of them. Under the GFDL license (Wikipedia's form of 'public domain') the material from Wikipedia can freely be used by others, as long as the source is properly acknowledged.

Nothing 'stolen' should remain on Wikipedia. I have posted before about copyright violation.

Link to comment
Share on other sites

Please sign in to comment

You will be able to leave a comment after signing in



Sign In Now
×
×
  • Create New...