We are sorry to see you leave - Beta is different and we value the time you took to try it out. Before you decide to go, please take a look at some value-adds for Beta and learn more about it. Thank you for reading Slashdot, and for making the site better!
An anonymous reader writes Only days after receiving harsh criticism from all corners of the internet for taking down links to news articles, Google has started to reinstate those links. Google's Peter Barron denied that they were simply granting all "right to be forgotten" requests. "The European Court of Justice [ECJ] ruling was not something that we welcomed, that we wanted — but it is now the law in Europe and we are obliged to comply with that law," he said. Still, Google's actions are being called "tactical" for how quickly they were able to stir public dissent over the EU ruling. "It's convenient, then, that it's found a way to get the media to kick up the fuss for it: there are very few news organisations in the world who are happy to hear their output is being stifled. A few automated messages later, the story is back in the headlines – and Google is likely to be happy about that."
An anonymous reader writes in with this article from the BBC about Google's recent removal of a news story from search results. "Google's decision to remove a BBC article from some of its search results was "not a good judgement", a European Commission spokesman has said. A link to an article by Robert Peston was taken down under the European court's "right to be forgotten" ruling. But Ryan Heath, spokesman for the European Commission's vice-president, said he could not see a "reasonable public interest" for the action. He said the ruling should not allow people to "Photoshop their lives". The BBC understands that Google is sifting through more than 250,000 web links people wanted removed."
Albanach writes: In 2007, the BBC's economics editor, Robert Peston, penned an article on the massive losses at Merrill Lynch and the resulting resignation of their CEO Stan O'Neal. Today, the BBC has been notified that the 2007 article will no longer appear in some Google searches made within the European Union, apparently as a result of someone exercising their new-found "right to be forgotten." O'Neal was the only individual named in the 2007 article. While O'Neal has left Merrill Lynch, he has not left the world of business, and now holds a directorship at Alcoa, the world's third largest aluminum producer with $23 billion in revenues in 2013.
An anonymous reader writes In the aftermath of the European Court of Justice "right to be forgotten" decision, many asked whether a similar ruling could arise elsewhere. While a privacy-related ruling has yet to hit Canada, Michael Geist reports that last week a Canadian court relied in part on the decision in issuing an unprecedented order requiring Google to remove websites from its global index. The ruling is unusual since its reach extends far beyond Canada. Rather than ordering the company to remove certain links from the search results available through Google.ca, the order intentionally targets the entire database, requiring the company to ensure that no one, anywhere in the world, can see the search results.
angry tapir (1463043) writes Microsoft will soon offer a service aimed at making machine-learning technology more widely usable. "We want to bring machine learning to many more people," Eron Kelly, Microsoft corporate vice president and director SQL Server marketing, said of Microsoft Azure Machine Learning, due to be launched in beta form in July. "The line of business owners and the marketing teams really want to use data to get ahead, but data volumes are getting so large that it is difficult for businesses to sift through it all," Kelly said. The service will have "...an interface called the Machine Learning Studio. The palette includes visual icons for some of the most commonly used machine-learning algorithms, allowing the user to drag and drop them into a visually depicted workflow." Algorithms themselves are implemented in R, which the user of the service can use directly as well.
NewYorkCountryLawyer (912032) writes In Authors Guild v Hathitrust, the US Court of Appeals for the Second Circuit has found that scanning whole books and making them searchable for research use is a fair use. In reaching its conclusion, the 3-judge panel reasoned, in its 34-page opinion (PDF), that the creation of a searchable, full text database is a "quintessentially transformative use", that it was "reasonably necessary" to make use of the entire works, that maintaining four copies of the database was reasonably necessary as well, and that the research library did not impair the market for the originals. Needless to say, this ruling augurs well for Google in Authors Guild v. Google, which likewise involves full text scanning of whole books for research.
Daniel_Stuckey (2647775) writes "Following broad security scares like that caused by the Heartbleed bug, it can be frustratingly difficult to find out if a site you use often still has gaping flaws. But a little known community of software developers is trying to change that, by creating a searchable, public index of websites with known security issues. Think of Project Un1c0rn as a Google for site security. Launched on May 15th, the site's creators say that so far it has indexed 59,000 websites and counting. The goal, according to its founders, is to document open leaks caused by the Heartbleed bug, as well as 'access to users' databases' in Mongo DB and MySQL. According to the developers, those three types of vulnerabilities are most widespread because they rely on commonly used tools. For example, Mongo databases are used by popular sites like LinkedIn, Expedia, and SourceForge, while MySQL powers applications such as WordPress, Drupal or Joomla, and are even used by Twitter, Google and Facebook."
redletterdave (2493036) writes 'Apple has purchased Spotsetter, a social search engine that uses big data to offer personalized recommendations for places to go. Spotsetter was designed to combine recommendations from friends with trusted reviews and other data to create more social maps. It would show you which friends were 'experts' in a given area, and you could tag your friends as experts (like LinkedIn) to boost the influence of their recommendations. You could also discover new places by browsing Spotsetter's maps to see where your friends have been and what they've recommended. Spotsetter's app, which was available on iOS and Android, officially closed down just six days ago.'
The EU's new rule (the result of a court case published May 13) requiring that online businesses remove on request information that is "inadequate, irrelevant or no longer relevant" has struck a chord with more than 12,000 individuals, a number that's rising fast. Other search engines, ISPs, and firms are sure to follow, but the most prominent reaction to the decision thus far, and one that will probably influence all the ones to come, is Google's implementation of an online form that users can submit to request that information related to them be deleted. The Daily Mail reports that the EU ruling "has already been criticised after early indications that around 12 per cent of applications were related to paedophilia. A further 30 per cent concern fraud and 20 per cent were about people's arrests or convictions"; we mentioned earlier this month one pedophile's request for anonymity. As the First Post story linked above puts it, the requirement that sites scrub their data on request puts nternet companies in the position of having to interpret the court’s broad criteria for information meeting the mandate's definition of "forgettable," "as well as developing criteria for distinguishing public figures from private individuals." Do you favor opt-out permissions for reporting facts linked to individuals? What data or opinions about themselves should people not be able to suppress? (Note: Google's form has this disclaimer: "We're working to finalize our implementation of removal requests under European data protection law as soon as possible. In the meantime, please fill out the form below and we will notify you when we start processing your request." That finalization may take some time, since there are 28 data-protection agencies across the EU to harmonize.)
An anonymous reader writes "We've known for a while now that Google is testing a new favoriting service called Google Stars, aimed at helping users save, share, and organize Web content. This is largely due to multiple leaks, detailing features as well as showing off the interface in a video and screenshots. Today, Google+ user Florian Kiersch, who has done the majority of the digging behind the service, has leaked the Google Stars extension for Google Chrome."
First time accepted submitter S37Rigor Mortis (1601271) writes "Torrentz.eu, the largest torrent search engine on the Internet, has had its domain name suspended following a request from the Police Intellectual Property Crime Unit in the UK. The site continues to operate under two alternative domains, and is hoping to move the .eu domain to a new registrar." Update: 05/27 12:53 GMT by T : That was quick; the site is back, "after the owners pointed out that its suspension was illegal."
New submitter perplexing.reader (2241844) writes "Microsoft is paying Brazilian users US$2 in Skype vouchers to set Bing as their default search engine and MSN as their default home page. Translated from the site: 'Make MSN your homepage and Bing your default search engine and earn up to 60 minutes of calls to mobiles and landlines in Skype.' ... The Rules: 'After receiving the voucher, this should be used until July 31, 2014. Once on Skype, the credits do not expire. The minutes are based on a rate of $ 0.023 per minute, but the number of minutes may vary depending on the destination of the call and the number of calls you make. The current value of the voucher is $2.00. [One claimed], the voucher will appear in your Skype account." (For those outside Brazil, the page brings up a message that translates to "Sorry, this promotion is not available for your country.")
Paul Fernhout (109597) writes "MetaFilter recently announced layoffs due to a decline in ad revenue that started with a mysterious 40% drop in traffic from Google on November 17, 2012, and which never recovered. Danny Sullivan at SearchEngineLand explores in detail how MetaFilter 'serves as a poster child of problems with Google's penalty process, despite all the advances Google has made over the years.' Caitlin Dewey at the Washington Post puts it more bluntly: 'That may be the most striking, prescient takeaway from the whole MetaFilter episode: the extent to which the modern Web does not incentivize quality.'"
An anonymous reader writes "Microsoft today confirmed the rumors of a new edition of its latest operating system by unveiling Windows 8.1 with Bing. The company says the main purpose of the new SKU is to allow its hardware partners to sell lower-cost Windows devices; the first ones with the new edition will be announced next month at Computex in Tapei. Windows 8.1 with Bing is exactly like Windows 8.1 with the recently released Windows 8.1 Update, with one major difference: Bing is set as the default search engine in Internet Explorer. Users can still change that option in IE's search engine settings, but OEMs do not have that luxury."
NapalmV sends this news from the BBC: "The European Union Court of Justice said links to 'irrelevant' and outdated data should be erased on request. The case was brought by a Spanish man who complained that an auction notice of his repossessed home on Google's search results infringed his privacy. Google said the ruling was 'disappointing.'" The EU Justice Commissioner said, "Companies can no longer hide behind their servers being based in California or anywhere else in the world. ... The data belongs to the individual, not to the company. And unless there is a good reason to retain this data, an individual should be empowered — by law — to request erasure of this data." According to the ruling (PDF), if a search provider declines to remove the data, the user can escalate the situation to a judicial authority to make sure the user's rights are being respected.
Mark.JUK (1222360) writes "The European Court of Justice (ECJ) has today ruled that Google, Bing and others, acting as internet search engine operators, are responsible for the processing that they carry out of personal data which appears on web pages published by third parties. As a result any searches made on the basis of a person's name that returns links/descriptions for web pages containing information on the person in question can, upon request by the related individual, be removed. The decision supports calls for a so-called 'right to be forgotten' by Internet privacy advocates, which ironically the European Commission are already working to implement via new legislation. Google failed to argue that such a decision would be unfair because the information was already legally in the public domain."
KindMind (897865) writes "From the Washington Post: 'Psychologist Robert Epstein has been researching [how much influence search engines have on voting behavior] and says he is alarmed at what he has discovered. His most recent experiment, whose findings were released Monday, found that search engines have the potential to profoundly influence voters without them noticing the impact ... Epstein, former editor-in-chief of Psychology Today and a vocal critic of Google, has not produced evidence that this or any other search engine has intentionally deployed this power. But the new experiment builds on his earlier work by measuring SEME (Search Engine Manipulation Effect) in the concrete setting of India's national election, whose voting concludes Monday.'"
Daniel_Stuckey (2647775) writes "How risky is it to use the words "bomb," "plague," or "gun" online? That was a question we posed, tongue in cheek, with a web toy we built last year called Hello NSA. It offers users suggested tweets that use words that drawn from a list of watchwords that analysts at the Dept. of Homeland Security are instructed to search for on social media. "Stop holding my love hostage," one of the tweets read. "My emotions are like a tornado of fundamentalist wildfire." It was silly, but it was also imagined as an absurdist response to the absurdist ways that dragnet surveillance of the public and non-public Internet jars with our ideas of freedom of speech and privacy. And yet, after reading the mounting pile of NSA PowerPoints, are all of us as comfortable as we used to be Googling for a word like "anthrax," even if we were simply looking up our favorite thrash metal band? Maybe not. According to a new study of Google search trends, searches for terms deemed to be sensitive to government or privacy concerns have dropped "significantly" in the months since Edward Snowden's revelations in July."
itwbennett writes: "A class-action lawsuit filed Thursday (PDF) accuses Google of strong-arming device manufacturers into making its search engine the default on Android devices, driving up the cost of those devices and hurting consumers. The suit does not argue that device manufacturers entered Mobile Application Distribution Agreements involuntarily, but that the market power of Google compels them to. 'Because consumers want access to Google's products, and due to Google's power in the U.S. market for general handheld search, Google has unrivaled market power over smartphone and tablet manufacturers,' says the suit."
itwbennett (1594911) writes "Google will no longer scan the email messages of students and other school staff who use its Google Apps for Education suite, exempting about 30 million users from the chronically controversial practice for Gmail advertising. In addition, Google is removing the option for Apps for Education administrators to allow ads to be shown to their users. Until now, ads were turned off by default, but admins could turn on this feature at their discretion. A Google spokesperson called the move part of a 'continued evolution of our efforts to provide the best experience for our users, including students' and not a response to a recent lawsuit alleging that by scanning Gmail messages Google violated wiretapping laws and breached users' privacy."
Daniel_Stuckey (2647775) writes "If you were anywhere near the internet last week, you would have come across reports of 'DarkMarket', a new system being touted as a Silk Road the FBI could never seize. Although running in a similar fashion on the face of things — some users buy drugs, other sell them — DarkMarket works in a fundamentally different way to Silk Road or any other online marketplace. Instead of being hosted off a server like a normal website, it runs in a decentralized manner: Users download a piece of software onto their device, which allows them to access the DarkMarket site. The really clever part is how the system incorporates data with the blockchain, the part of Bitcoin that everybody can see. Rather than just carrying the currency from buyer to seller, data such as user names are added to the blockchain by including it in very small transactions, meaning that its impossible to impersonate someone else because their pseudonymous identity is preserved in the ledger. Andy Greenberg has a good explanation of how it works over at Wired. The prototype includes nearly everything needed for a working marketplace: private communications between buyers and sellers, Bitcoin transfers to make purchases, and an escrow system that protects the cash until it is confirmed that the buyer has received their product. Theoretically, being a decentralized and thus autonomous network, it would still run without any assistance from site administrators, and would certainly make seizing a central server, as was the case with the original Silk Road, impossible."
An anonymous reader writes "The Citizens Commission on Human Rights (CCHR), a Scientology front group, has received a 'grant from Google in the amount of $10,000 per month worth of Pay Per Click Advertising to be used in our Orange County anti-psych campaigns.' CCHR believes that ALL psychiatrists are evil. They believe that psychiatrists were behind the holocaust, and these shadow men were never brought to justice. CCHR also believes that psychiatrists were behind the 911 attacks. Scientologists believe that psychiatrists have always been evil, and their treachery goes back 75 million years when the psychiatrists assisted XENU in killing countless alien life forms. Thanks Google! We may be able to stop these evil Psychs once and for all!"
First time accepted submitter turkeydance (1266624) writes "The dark web just got a little less dark with the launch of a new search engine that lets you easily find illicit drugs and other contraband online. Grams, which launched last week and is patterned after Google, is accessible only through the Tor anonymizing browser (the address for Grams is: grams7enufi7jmdl.onion) but fills a niche for anyone seeking quick access to sites selling drugs, guns, stolen credit card numbers, counterfeit cash and fake IDs — sites that previously only could be found by users who knew the exact URL for the site."
judgecorp (778838) writes "Three weeks after Russia asserted that Crimea is part of its territory, the social networks have a problem: how to categories their users from the region? Facebook and the largest Russian social network, Vkontakte, still say Crimeans are located in Ukraine, while other Russian social networks say they are Russians. Meanwhile, on Wikipedia, an edit war has resulted in Crimea being part of Russia, but shaded a different colour to signify the territory is disputed. Search engine Yandex is trying to cover both angles: its maps service gives a different answer, depending on which location you send your query from."
An anonymous reader writes "Members of the Senate have proposed a bill that would prohibit the National Technical Information Service (NTIS) from selling to other U.S. federal agencies technical papers that are already freely available. NTIS is under the Department of Commerce. The bill is probably a result of a 2012 report by the Government Accountability Office (GAO) which points out that 'Of the reports added to NTIS's repository during fiscal years 1990 through 2011, GAO estimates that approximately 74 percent were readily available from other public sources.' Ars Technica notes that the term 'public sources' refers to 'either the issuing organization's website, the federal Internet portal, or another online resource.'"
redletterdave (2493036) writes "Facebook owns virtually all the aspects of the social experience—photos (Instagram), status updates (Facebook), location services (Places)—but now, Facebook is transitioning from a simple social network to a full-fledged technology company that rivals Google, moonshot for moonshot. Yet, it's Facebook's corporate control of traffic that leads many to distrust the company. In a sense, people are stuck. When the time comes for someone to abandon Facebook, whether over privacy concerns or frustration with the company, Facebook intentionally makes it hard to leave. Even if you delete your account, your ghost remains—even when you die, Facebook can still make money off you. And that's not behavior fit for a company that's poised to take over the future."
theodp (442580) writes "Over the years, Mozilla's reliance on Google has continued to grow. Indeed, in its report on Brendan Eich's promotion to CEO of Mozilla, the WSJ noted that "Google accounted for nearly 90% of Mozilla's $311 million in revenue." So, with its Sugar Daddy having also gone on record as being virulently opposed to Proposition 8, to think that that Google's support didn't enter into discussions of whether Prop 8 backer Eich should stay or go seems, well, pretty much unthinkable. "It is the chilling and discriminatory effect of the proposition on many of our employees that brings Google to publicly oppose Proposition 8," explained Google co-founder Sergey Brin in 2008. "We should not eliminate anyone's fundamental rights, whatever their sexuality, to marry the person they love." Interestingly, breaking the news of Eich's resignation was journalist Kara Swisher, whose right to marry a top Google exec in 2008 was nearly eliminated by Prop 8. "In an interview this morning," wrote Swisher, "Mozilla Executive Chairwoman Mitchell Baker said that Eich's ability to lead the company that makes the Firefox Web browser had been badly damaged by the continued scrutiny over the hot-button issue, which had actually been known since 2012 inside the Mozilla community." Swisher, whose article was cited by the NY Times in The Campaign Against Mozilla's Brendan Eich, added that "it was not hard to get the sense that Eich really wanted to stick strongly by his views about gay marriage, which run counter to much of the tech industry and, increasingly, the general population in the U.S. For example, he repeatedly declined to answer when asked if he would donate to a similar initiative today." So, was keeping Eich aboard viewed by Mozilla — perhaps even by Eich himself — as a possible threat to the reported $1 billion minimum revenue guarantee the organization enjoys for delivering search queries for Google?"
redletterdave writes: "At the Fire TV unveiling, Amazon officials sounded like they perfectly understood how frustrating TV streaming devices are for their owners. Amazon focused on three main problems: Search is hard, especially for anything not on a bestseller list; streaming devices often provide slow or laggy performance; and TV set-top boxes tend to be closed ecosystems. The Fire TV is Amazon's attempt to solve these three problems—the key word here being 'attempt.' Perhaps Amazon's homegrown solution was a bit premature and its ambitions too lofty, because while Fire TV can do almost everything, little of it is done right." An example given by the review is how the touted Voice Search works — it doesn't interact at all with supported apps, instead bringing up Amazon search results. Thus, even if you have access to a movie for free through Netflix, using the Voice Search for that movie will only bring up Amazon's paid options.
harrymcc (1641347) writes "Google officially — and mischievously — unveiled Gmail on April Fools' Day 2004. That makes this its tenth birthday, which I celebrated by talking to a bunch of the people who created the service for TIME.com. It's an amazing story: The service was in the works for almost three years before the announcement, and faced so much opposition from within Google that it wasn't clear it would ever reach consumers." Update: 04/01 13:37 GMT by T : We've introduced a lot of new features lately; some readers may note that with this story we are slowly rolling out one we hope you enjoy -- an audio version of each Slashdot story. If you are one of the readers in our testing pool, you'll hear the story just by clicking on it from the home page as if to read the comments; if you're driving, we hope you'll use your mobile devices responsibly.
jfruh (300774) writes "Facebook and Google only make money from people when they can access the Internet, so it stands to reason that they're working to make sure everyone has access. While Google's Project Loon seeks to use balloon-borne equipment, Mark Zuckerberg envisions a system of drones and laser beams."
jfruh writes: "You will probably not be surprised to learn that Chinese search giant Baidu censors a wide range of content, particularly political material deemed to be pro-democracy — and does so for users everywhere, not just in China. A group of activists filed suit against Baidu in New York for violating free speech laws, but the judge in the case declared (PDF) that, as a private entity in the United States, Baidu has the right to provide whatever kind of search results it wants, even for political reasons."
An anonymous reader writes "Google today announced Google Now is coming to the Chrome stable channel for Windows and Mac 'starting today and rolling out over the next few weeks.' This means Google Now notifications will finally be available to desktop and laptop Chrome users, in addition to Android and iOS users. To turn the feature on, all you need to do is sign in to Chrome with the same Google Account you're using for Google Now on mobile. If you use Google Now on multiple devices, you will need to manage your location settings for each device independently (change Location Reporting on Android and iOS)."
jfruh writes "For years, paid links returned from Google search queries have been set off from 'real' search results by their placement on the page and by a colored background. But some users have begun to see a different format for these ads: a tiny yellow button that reads 'AD' at the end of the link is the only distinguishing feature. Google is notoriously close-mouthed about this sort of thing, but it may begin rolling the new format out to more users soon."
wabrandsma writes with this story from NewScientist: "Google may be a master at data wrangling, but one of its products has been making bogus data-driven predictions. A study of Google's much-hyped flu tracker has consistently overestimated flu cases in the US for years. It's a failure that highlights the danger of relying on big data technologies.
Evan Selinger, a technology ethicist at Rochester Institute of Technology in New York, says Google Flu's failures hint at a larger problem with the algorithmic approach taken by technology companies to deliver services we all want to use. The problem is with the assumption that either the data that is gathered about us, or the algorithms used to process it, are neutral. Google Flu Trends has been discussed at slashdot before: When Google Got Flu Wrong."
An anonymous reader writes "Google is facing investigation by the Competition Commission of India and potentially faces fines up to 10% of its three-year average turnover. While Google has settled anti-trust cases in the U.S. and the European Union, India's competition regime does not have provisions for settlement process." From the Times of India article linked: "The complaint against Google, also one of the world's most valued company, was first filed by advocacy group CUTS International way back in late 2011. Later, matrimonial website matrimony.com also filed a complaint. Referring to Google's settlement with the European Commission, matrimony.com counsel Ferida Satarawala said: 'Google's unfair use of trademarks as well as its retaliatory conduct are not specifically addressed in the European settlement and are distinct theories of harm being pursued by the CCI. Therefore, this settlement is unlikely to address CCI's concerns in our case.'"
An anonymous reader writes "Google's Eric Schmidt and Jared Cohen were [part of a] wide-ranging session at SXSW today and they revealed that Google's data is now safely protected from the prying eyes of government organizations. In the last few days Google upgraded its security measure following revelations that Britain's GCHQ had intercepted data being transmitted between Google datacenters, Schmidt said that his company's upgrades following the incident left him 'pretty sure that information within Google is now safe from any government's prying eyes.'"
An anonymous reader writes "Microsoft is exploring whether to release a free version of Windows to increase the number of computers using the latest operating system. Currently the company seems to be testing a new version of the OS called 'Windows 8.1 with Bing', which will include Microsoft's key modern apps and services."
cold fjord writes with news that the Supreme Court has expanded the ability of police officers to search a home without needing a warrant, quoting the LA Times: "Police officers may enter and search a home without a warrant as long as one occupant consents, even if another resident has previously objected, the Supreme Court ruled Tuesday ... The 6-3 ruling ... gives authorities more leeway to search homes without obtaining a warrant, even when there is no emergency. The majority ... said police need not take the time to get a magistrate's approval before entering a home in such cases. But dissenters ... warned that the decision would erode protections against warrantless home searches." In this case, one person objected to the search and was arrested followed by the police returning and receiving the consent of the remaining occupant.
itwbennett writes "Google has done what the European Commission declined to do: publish the details of the latest commitments Google made in a bid to settle a long-running antitrust case involving its treatment of rival specialist search services, among other matters. On the company's European policy blog, Google's senior vice president and general counsel, Kent Walker, announced the publication of what he called the 'full text' (PDF) of the company's commitments. In fact, the 93-page document contains a number of redactions, including details of a parameter used to rank search results, the identities of two companies with customized contracts for Adsense For Search, and a proposal for modification of those contracts to comply with the other commitments."
Nerval's Lobster writes "Microsoft has censored Chinese-language results for Bing users in the United States as well as mainland China, according to an article in The Guardian. But this isn't the first time that Bing's run into significant controversy over the 'sanitizing' of Chinese-language search results outside of mainland China. In November 2009, Microsoft came under fire from free-speech advocates after New York Times columnist Nicholas Kristof accused the company of 'craven kowtowing' to the mainland Chinese government by sanitizing its Chinese-language search results for users around the world. Just as with The Guardian and other news outlets this week, Microsoft insisted at the time that a 'bug' was to blame for the sanitized search results. 'The bug identified in the web image search was indeed fixed,' a Microsoft spokesperson told me in December 2009, after I presented them with a series of screenshots suggesting that the pro-Chinese-government filter remained in effect even after Kristof's column. 'Please also note that Microsoft 'recognize[s] that we can continue to improve our relevancy and comprehensiveness in these web results and we will.' Time will tell whether anything's different this time around."
First time accepted submitter vigna writes "The Laboratory for Web Algorithmics of the Università degli studi di Milano together with the Data and Web Science Group of the University of Mannheim have put together the first entirely open ranking of more than 100 million sites of the Web. The ranking is based on classic and easily explainable centrality measures applied to a host graph, and it is entirely open — all data and all software used is publicly available. Just in case you wonder, the number one site is YouTube, the second Wikipedia, and the third Twitter." They are using the Common Crawl data (first released in November 2011). Pages are ranked using harmonic centrality with raw Indegree centrality, Katz's index, and PageRank provided for comparison. More information about the web graph is available in a pre-print paper that will be presented at the World Wide Web Conference in April.
kc123 sends this report from The Guardian: "Microsoft's search engine Bing appears to be censoring information for Chinese language users in the U.S. in the same way it filters results in mainland China. Searches first conducted by anti-censorship campaigners at FreeWeibo, a tool that allows uncensored search of Chinese blogs, found that Bing returns radically different results in the U.S. for English and Chinese language searches on a series of controversial terms. These include Dalai Lama, June 4 incident (how the Chinese refer to the Tiananmen Square protests of 1989), Falun Gong and FreeGate, a popular internet workaround for government censorship."
mpicpp points out an article detailing the case of French blogger Olivier Laurelli, who had the misfortune to click links from search results. Laurelli stumbled upon a public link leading to documents from the French National Agency for Food Safety, Environment, and Labor. He downloaded them — over 7 Gb worth — and looked through them, eventually publishing a few slides to his website. When one of France's intelligence agencies found out, they took Laurelli into custody and indicted him, referring to him as a 'hacker.' In their own investigation, they said, "we then found that it was sufficient to have the full URL to access to the resource on the extranet in order to bypass the authentication rules on this server." The first court acquitted Laurelli of the charges against him. An appeals court affirmed part of the decision, but convicted him of "theft of documents and fraudulent retention of information." He was fined €3,000 (about $4,000).
coondoggie writes "The scientists at DARPA say the current methods of searching the Internet for all manner of information just won't cut it in the future. Today the agency announced a program that would aim to totally revamp Internet search and 'revolutionize the discovery, organization and presentation of search results.' Specifically, the goal of DARPA's Memex program is to develop software that will enable domain-specific indexing of public web content and domain-specific search capabilities. According to the agency the technologies developed in the program will also provide the mechanisms for content discovery, information extraction, information retrieval, user collaboration, and other areas needed to address distributed aggregation, analysis, and presentation of web content."
ananyo writes "Publishing giant Elsevier says that it has now made it easy for scientists to extract facts and data computationally from its more than 11 million online research papers. Other publishers are likely to follow suit this year, lowering barriers to the computer-based research technique. But some scientists object that even as publishers roll out improved technical infrastructure and allow greater access, they are exerting tight legal controls over the way text-mining is done. Under the arrangements, announced on 26 January at the American Library Association conference in Las Vegas, Nevada, researchers at academic institutions can use Elsevier's online interface (API) to batch-download documents in computer-readable XML format. Elsevier has chosen to provisionally limit researchers to 10,000 articles per week. These can be freely mined — so long as the researchers, or their institutions, sign a legal agreement. The deal includes conditions: for instance, that researchers may publish the products of their text-mining work only under a license that restricts use to non-commercial purposes, can include only snippets (of up to 200 characters) of the original text, and must include links to original content."
jfruh writes "Baidu, a company that offers a search engine, a Wikipedia-style user-edited encyclopedia, and other online services, is a household name in China. Now the company is seeking to gain ground on Google in the rest of the world, opening local search sites (in local languages) for Thailand, Brazil, and Egypt."
An anonymous reader writes "Canadian law professor Michael Geist reports that the Canadian arm of the RIAA is calling for new Internet regulation, including website blocking and search result manipulation. While the Canadian music industry experienced increased digital sales last year (sales declined in the U.S.) and the Ontario government is handing out tens of millions of tax dollars to the industry, the industry now wants the government to step in with website blocking and ordering search companies to change their results to focus on iTunes and other sales sites."
waderoush writes "An Xconomy column [Friday] suggests that Google is getting too big. When the company was younger, most of its acquisitions related to its core businesses of search, advertising, network infrastructure, and communications. More recently, it's been colonizing areas with a less obvious connection to search, such as travel, social networking, productivity, logistics, energy, robotics, and — with the acquisition this week of Nest Labs — home sensor networks and automation. A Google acquisition can obviously mean a big payoff for startup founders and their investors, but as the company grows by accretion it may actually be slowing innovation in Silicon Valley (since teams inside the Googleplex, with its endless fountain of AdWords revenue, can stop worrying about making money or meeting market needs). And by infiltrating so many corners of consumers' lives — and collecting personal and behavioral data as it goes — it's becoming an all-encompassing presence, and making itself ever more attractive as a target for marketers, data thieves, and government snoops. 'Any sufficiently advanced search, communications, and sensing infrastructure is indistinguishable from Big Brother,' the column argues."
theodp writes "Writing in the NY Times, Dr. Haider Javed Warraich shares a dirty little medical secret: doctors do 'Google' their patients, and the practice is likely to only become more common. And while he personally feels the practice should be restricted to situations where there's a genuine safety issue, an anecdote Warraich shares illustrates how patient search could provide insight into what otherwise might be unsolved mysteries — or lead to a snap misdiagnosis: 'I was once taking care of a frail, older patient who came to the hospital feeling very short of breath. It wasn't immediately clear why, but her breathing was getting worse. To look for accidental ingestions, I sent for a drug screen and, to my great surprise, it came back positive for cocaine. It didn't make sense to me, given her age and the person lying before me, and I was concerned she had been the victim of some sort of abuse. She told me she had no idea why there was cocaine in her system. When I walked out of the room, a nurse called me over to her computer. There, on MugShots.com, was a younger version of my patient's face, with details about how she had been detained for cocaine possession more than three decades earlier. I looked away from the screen, feeling like I had violated my patient's privacy. I resumed our medical exam, without bringing up the finding on the Internet, and her subsequent hospital course was uneventful.'"