EFY Times  
Wednesday, May 22, 2013

 
GO
 
 
Viewing The World Through The Eyes Of Wikipedia
 
Home >> Infotech >> Trends
 
Viewing The World Through The Eyes Of Wikipedia  
 
   
Rate this news:  (0 Votes)
Thursday, June 21, 2012 SGI (NASDAQ: SGI), the trusted leader in technical computing has partnered with Kalev H. Leetaru of the University of Illinois to create the first-ever historical mapping and exploration of the full text contents of the English-language edition of Wikipedia, in time and space. The results include visualizations of modern history captured in under a day utilizing in-memory data-mining techniques. Loading the entire English language edition of Wikipedia into SGI® UV™ 2000, Mr. Leetaru was able to show how Wikipedia’s view of the world unfolded over the past two centuries. Location, year and the positive or negative sentiment have been tied to those references.






While several previous projects have mapped Wikipedia entries with manually assigned location metadata by an editor, these previous attempts only accounted for a tiny fraction of Wikipedia’s location information. This project unlocked the contents of the articles themselves, identifying every location and date in all four million pages and the connections among them to create a massive network.

“Seeing” Wikipedia in a brand new way

“This analysis allows the world to take a step back from the individual articles and text to gain a forest view of the tremendous knowledge captured in Wikipedia, not just a page by page tree view. We can watch how one of the largest collections of human knowledge has evolved and see what we could never see before, such as global sentiment at a certain time and place, or where there might be blind spots in the knowledge coverage, ” said Franz Aman, chief marketing officer and head of strategy, SGI. “We love to use Google Earth because we can zoom out and get the big picture view. With SGI UV 2, we can apply the same concept to Big Data to get the big picture on our Big Data.”

From this analysis, Wikipedia is seen to have four periods of growth in its historical coverage: 1001-1500 (Middle Ages), 1501-1729 (Early Modern Period), 1730-2003 (Age of Enlightenment), 2004-2011 (Wikipedia Era) and its continued growth appears to be focused on enhancing its coverage of historical events, rather than increased documenting of the present. The average tone of Wikipedia’s coverage of each year closely matches major global events, with the most negative period in the last 1,000 years being the American Civil War, followed by World War II. The analysis also shows that the “copyright gap” that blanks out most of the twentieth century in digitized print collections is not a problem with Wikipedia where there is steady exponential growth in its coverage from 1924 to today.

Enabling researchers to data-mine Big Data at the speed of Big Data

“The one-way nature of connections in Wikipedia, the lack of links, and the uneven distribution of Infoboxes, all point to the limitations of metadata-based data mining of collections like Wikipedia,” said Mr. Leetaru. “With SGI UV 2, the large shared memory available allowed me to ask questions of the entire dataset in near-real time. With a huge amount of cache-coherent shared memory at my fingertips, I could simply write a few lines of code and run it across the entire dataset, asking whatever questions came to mind. This isn’t possible with a scale-out computing approach. It’s very similar to using a word processor instead of using a typewriter – I can conduct my research in a completely different way, focusing on the outcomes, not the algorithms.”

The analytical approach

Loaded into SGI® UV™ 2000, the Big Brain computer, this massive dataset underwent full text geocoding and complete date-coding, using algorithms that identified every mention of every location and every date across the text of every entry on Wikipedia. More than 80 million locations and 42 million dates between 1000 AD and 2012 were extracted, averaging 19 locations and 11 dates per article (every 44 words and every 75 words, respectively). The connections between every date and every location were captured into a massive network representing Wikipedia’s view of history. With this instrumentation, Mr. Leetaru was able to perform near-real time analysis over the entire dataset on the SGI UV 2 to create visual maps throughout space and time to see not only how history unfolded but also the overall tone of the world throughout the last thousand years, and interactively testing a wide array of theories and research questions, all in less than a day’s work.

The New SGI UV: The Big Brain computer

SGI UV 2 product family enables users to find answers to the world’s most difficult problems on a system as easy to administer as a workstation. Built with Intel® Xeon® processor E5 family, running standard Linux, and supporting a wide range of storage options, SGI UV 2 offers a complete, industry-standard solution for no-limit computing.

With as little as 16 cores and 32 gigabytes of memory, SGI UV 2 can start small and seamlessly expand. This next generation platform doubles the number of cores (up to 4096 cores) and quadruples the amount of coherent main memory (up to 64 terabytes) from the previous generation, available for in-memory computing in a single-image system. SGI UV 2 can scale to eight petabytes of shared memory and at a peak I/O rate of four terabytes per second (14 PB/hour) it could ingest the entire contents of the U.S. Library of Congress print collection in less than three seconds.

SGI UV 2000 is available immediately. SGI UV 20 can be ordered today and will start shipping in August 2012. Pricing starts at $30,000 USD.



Print Email Post Comment 
(Total Views: 471)
 
Share
 
 
Infotech News
   
Two Basic Steps To Increase PC's Speed
Send Scented Messages Using Scentee Smartphone Addon
Opera For Android Out Of Beta; Now Available On Google Play
'Desi' Facebook And Twitter Coming Soon?
5 Top And Free Image Hosting Websites
 
 
 
     
     
     
Press Release
     
DISH Anywhere App Upgraded, Includes On ...
Powermat And PowerKiss To Unite
Mosaik Solutions Launches CellMaps ...
Major League Soccer And Windows 8 Bring ...
TiVo Reports Results For The First ...
LXI For Collider Signal Monitoring At ...
Mobile Operators: Make Cellular And ...
Times Mobile Ltd Brings A Home ...
Nearly 3,000 Participants Attended ...
Tech Mahindra Q4 PAT At Rs 377 Crores, ...
Recommend.ly The Easiest Way To Gain ...
Mahindra Racing Launches Android-Based ...
Tata Communications’: 40th Anniversary ...
Jelastic Launches New Version Of Its ...
Soyer5001T: Put The Zing Back In Your ...
Tata DOCOMO Inks Exclusive Partnership ...
F5 Addresses The Escalating Application ...
Romanian Teenager Wins Big For ...
CMC Wins TV5 Business Leader Award In ...
SP/Silicon Power Presents Jewel J10 USB ...
Yebhi.com Launches 30 Virtual Stores ...
Amdocs Announces Cloud-based Business ...
Record Number Of New Exhibitors Join ...
His Excellency Premier Li Keqiang ...
Jogesh K. Jaitly Moves From Samsung To ...
 
Ericsson Brings Carrier-Grade Wi-Fi To ...
Axis Announces A High-Performance Video ...
Achieve Cost Savings With SapphireIMS ...
0% 6 And 12 Month EMI On Samsung ...
Seagate Delivers Industry’s First ...
Xilinx Achieves PCI Express Compliance ...
Plancess Partners With LurnQ Taking The ...
Vuclip Redefines Mobile Ad ...
EMC Components: Extremely Miniaturized ...
Snapdeal.com Exclusively Launches A ...
Amdocs Unveils Industry’s First Elastic ...
NASSCOM Announces Engineering Council ...
MAIT Felicitates Uttarakhand Government ...
Sumitomo Corporation And NEC Provide ...
Gionee Announces “ELife” Its Ultra ...
Discover Browser Beauty With Opera for ...
Dijit Media Introduces NextGuide Web, A ...
Wacom Offers Bamboo Loop - A Digital ...
SPOT Global Phone Brings Affordable, ...
Enjoy The Best Moments From The UEFA ...
Evolio Launches The Thinnest And ...
DIGISOL Rolls Out “Cool Summer Offer”, ...
One Percent Rise In Use Of Properly ...
Toshiba To Start Mass Production Of ...
Marvell Unveils Industry's First Mass ...
     
     
     
     
     
Most popular
 
 
 
 
Features
Four Best And Free Cloud Storages With Their Features
To make it easy for you to choose the best cloud storage option, we bring the top 4 cloud storages with their features....
Five Free Google Reader Alternatives
The Google Reader might be dying on 1 July but the RSS is definitely not! So here we bring to you 5 alternatives of Google Reader....
 
  View All
Videos
 
First Look: LG Optimus G
The phone sports a high-end display and comes powered by a powerful processor. ...
Create QR-Codes For Free
TEC-IT releases the freeware QR-Code Studio to provide a quick and convenient way of QR code creation for every application scenario....
DoT Secretary Shares Plans For Growth Of Telecom Sector
M.F. Farooqui has recently taken charge as secretary, Department of Telecom....
Hands-On: Sony Xperia Z
Xperia Z is Sony's first entrant model in the big-screen smartphone category. ...
Hands On: Videocon A30 Smartphone
Videocon, the consumer electronics company which is known for its refrigerators, washing machine and air-conditioner has unveiled its Android-based sm...
   
View All
   
 
Dialogue
 
“Open Source Technology Will Bring In A Services-Based Model With A Reasonable Opex, Zero Capex”
myOpenSourceStore.com is an open source solutions provider catering to businesses worldwide. ...
How OSS Helped A Construction Company Almost Halve Its IT Budget!
SEW Infra has been able to save nearly 40 per cent of its IT budget by deploying open source solutions....
Face To Face With Richard Stallman
The father of the free software movement, Richard M. Stallman talks on topics including why ‘Free Software’ matters so much, the entire confusion crea...
“We See India As Our Top Priority And Believe It To Be A Fascinating Market”
In an exclusive interview with EFY, Yamashita talks about the potential market in India, and Fujitsu’s marketing strategy to explore it....
Indian Market Is A Quality Conscious Market And The Customers Pay The Price For Quality
In an exclusive interview with EFY, Hidekazu Katsuno, president, ROHM Semiconductor Singapore Pte Ltd, talks about the company's strategy to capture t...
   
  View All
CeBIT 2013
 
Major Indian IT Companies Found Missing From CeBIT
Besides European companies, CeBIT 2013 attracted exhibitors and visitors in large numbers from all other continents as well. Poland was the partner co...
CeBIT 2013: Here Comes Brain Painting!
The system is basically a computer program that can help paralysed patients draw artworks simply by using the power of their brains. ...
CeBIT 2013: Fujitsu Unveils Lifebook E Line Notebooks
All three models in the series include flexible and convenient working functions that are normally expected in today’s premium business notebooks. ...
CeBIT 2013: Want To Feel Loved? Get 'Cuddle Jacket' For You
The 'cuddle jacket' can be helpful for kids suffering from autism and other sensory disorders....
CeBIT 2013: Here Comes Solar Powered Water Filtering Technology
The technology works in a unique way as it purifies water with the help of UV rays coming via daylight....
CeBIT 2013: Highlights Of Day 1!
Besides European companies, CeBIT 2013 attracted exhibitors and visitors from all other continents....
   
View All
   
 
Events
 
12 Nov: LASER World Of PHOTONICS INDIA

View All
   
   
 
 

home archives contact us advertise with us
           
Magazines Portals Directories Events News Verticals Educational Institute  
Electronics for You
Open Source for You
Facts for You
Electronics Bazaar
electronicsforu.com
efytimes.com
bpotimes.com
linuxforu.com
Electronics Annual Guide
EFY EXPO
EFY Awards
EduTech Expo
OSIWEEK Expo
Electronics
Infotech
Linux & Open Source
Consumer Electronics
Science & Technology
BPO
EFY Techcenter 
 
 
© Copyright 2013 EFY Enterprises Pvt. Ltd.
All rights reserved. Reproduction in whole or in part in any form or medium without written permission is prohibited.
Usage of the content from the web site is subject to Terms and Conditions