SAP Authors: Liz McMillan, Mat Mathews, PR.com Newswire, David Smith, Tim Crawford

Blog Feed Post

R among TechCrunch's 5 Trendy Open-Source Techs for Big Data

Tim Gasper (Product Manager at Big Data platform Infochimps) has an informative article at TechCrunch that provides an overview of five open-source technologies trending now for Big Data applications. They are: Storm and Kafka (for processing stream data) Drill and Dremel (for ad-hoc queries of big data) R (for data science with big data) Gremlin and Giraph (for graph analysis, e.g. of social networks) SAP HANA (for in-memory analytics). HANA isn't an open-source tool though, so perhaps the fifth slot should really go to ... Honourable mention D3, for web-based visualization. Regarding R, Gasper says that it is "incredibly powerful", "the new standard for statistics", and that "the R community is one of the most thrilling places to be in Big Data right now". He also mentions the RHadoop project ("R work very well with Hadoop") and the up-and-coming Julia project. You can read Gasper's complete overview of R and the other trending big-data technologies at link below. TechCrunch: Big Data Right Now: Five Trendy Open Source Technologies

Read the original blog entry...

More Stories By David Smith

David Smith is Vice President of Marketing and Community at Revolution Analytics. He has a long history with the R and statistics communities. After graduating with a degree in Statistics from the University of Adelaide, South Australia, he spent four years researching statistical methodology at Lancaster University in the United Kingdom, where he also developed a number of packages for the S-PLUS statistical modeling environment. He continued his association with S-PLUS at Insightful (now TIBCO Spotfire) overseeing the product management of S-PLUS and other statistical and data mining products.<

David smith is the co-author (with Bill Venables) of the popular tutorial manual, An Introduction to R, and one of the originating developers of the ESS: Emacs Speaks Statistics project. Today, he leads marketing for REvolution R, supports R communities worldwide, and is responsible for the Revolutions blog. Prior to joining Revolution Analytics, he served as vice president of product management at Zynchros, Inc. Follow him on twitter at @RevoDavid