Top LinkedIn Groups in 2014 for Analytics, Big Data, Data Mining, and Data Science

We analyze Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science. Overall activity drops about 25%, but membership growth accelerates in Q4 2013. We identify 4 group quadrants and find which groups are fastest growing and most active.

By Gregory Piatetsky, Apr 14, 2014. 

We update our analysis of Top 30 LinkedIn Groups for Analytics, Big Data, Data Mining, and Data Science (Dec 2013) and find several interesting trends. 

First, we found that growth slowed down in 2013 Q3 but resumed in 2013 Q4 and 2014 Q1. 

The Figure 1 (below) shows quarterly growth rates in top 30 groups. Except for two groups:Machine Learning and SAS & Analytics Users (not shown in Figure 1) which had big growth in 1 or 2 quarters and none in 2 other quarters, most groups show surprisingly similar pattern of decline in growth in 13Q3, followed by acceleration in 14Q1 and 14Q2. 

Top Linked Analytics Groups, Growth 2013Q2 to 2014Q1 
Fig 1: Top Linked Analytics Groups, Quarterly Growth 2013Q2 to 2014Q1. Thick black line is the overall average growth rate. 

Here are the 10 largest groups (by membership as of March 31, 2014). We note that 7 largest were in the same order as in Nov 2013. The 6 largest grew significantly faster than the next 4 groups.
  • Advanced Business Analytics, Data Mining and Predictive Modeling: 121,816 (74% growth in 12 months)
  • Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group: 95,638 ( 82% growth)
  • Big Data and Analytics: 74,350 (100%)
  • Business Analytics: 53,345 (43%)
  • Data Mining, Statistics, Big Data, and Data Visualization: 43,761 (116%)
  • BIG DATA Professionals - Architects Scientists Analytics Experts: 30,792 (92%)
  • Next Gen Market Research (NGMR): 23,368 (15%)
  • SAS Analytics & BI (closed): 20,941 (32%)
  • Business Intelligence & Analytics Group: 20,000 (4%)
  • Global Analytics Network: 19,389 (11%)

 
However, there seems to be no strong correlation between group size and growth rate among all 30 groups. 

Here are 10 groups with the fastest growth in the past 12 months (March 25, 2013 to March 31, 2014)
  • RDataMining: 126%
  • Data Mining, Statistics, Big Data, and Data Visualization: 116%
  • Data Scientists: 114%
  • Big Data and Analytics: 100%
  • BIG DATA Professionals - Architects Scientists Analytics Experts: 92.5%
  • Big Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group: 82%
  • Advanced Business Analytics, Data Mining and Predictive Modeling: 74%
  • KDnuggets Analytics, Data Mining, and Data Science: 73%
  • Predictive Analytics Network (PAN): 72%

 
The chart below shows group growth vs group size. Color corresponds to age - redder is younger, bluer is older. Group name abbreviations are in the table below. 

Top Linked Analytics Group by 2014 size vs growth 
Fig 2: Top Linked Analytics, Big Data, Data Science Groups by 2014 size vs growth 

There are 2 main measures of group activity: discussions (posts)/week and comments/week. Since these numbers clearly depend on the group size, we measure them per 1000 members. We measure overall group activity as (discussions + comments / week) per 1000 members. 

For 4 months ending in March 2014, activity level was 2.99/week, about 25% less than 3.97/week measured in Nov 2013. 

The chart below shows group activity vs group size. Color corresponds to age - redder is younger, bluer is older. Group name abbreviations are in the table below. 

Top Linked Analytics Group by 2014 Activity vs Growth 
Fig 3: Top Linked Analytics, Big Data, Data Science Groups - 2014 Activity vs Growth 

In 4 month ending in March 2014 the average activity level was 2.03 discussion/week per 1K members, and 0.96 comments/week per 1K members, or about 2.1 discussions/comment, well below 2.57 discussions/week per 1K members and 1.40 comments/week per 1K members measured in Nov 2013 (1.8 discussions/comment). This means that the while activity has slowed down, the gap between discussions and comments has increased. 

The chart below shows average comments/week vs average discussions/week for all 30 groups, with a circle size proportional to group size and circle color corresponding to activity change - green meaning increase, red decrease. We also show median lines for each dimension, which can be used to divide the groups in 4 quadrants. 

Top Linked Analytics Group by 2014 Activity vs Growth 
Fig 4: 4 Quadrants of Top Linked Analytics, Big Data, Data Science Groups: Commenting vs Posting 

Several groups stand out: KDnuggets has the highest number of discussions/1000 members, while RDM has a highest number of comments. The median line divide the groups in 4 quadrants, which we can characterize as
  • "Engaged" (above median on both comments and discussions): KDnuggets, Dscientists, PAN, DM Stat, RDM, Big Data & A, Adv BADM)
  • "Posting" (above median on discussions, below median on comments): DSC, Global A
  • "Commenting" (below median on discussions, above median on comments): NGMR, RMA/RMDS, PR
  • "Passive" (below median on both comments and discussions)

 
The details are in the table with below, with groups ordered by the number of members. The link to the raw data is at the end of the post. 

The growth, comments, and discussions are in green font if that value is 25% above average, 
in red if 25% below average, and in black otherwise. 
We note that there are only 4 "triple green" groups, that are significantly above average on growth, comments, and discussions:
  • Data Mining, Statistics, Big Data, and Data Visualization
  • Data Scientists
  • RDataMining
  • KDnuggets Analytics, Data Mining, and Data Science

 
LinkedIn GroupMembers
(Mar 31, 2014)
Founded12 mon Growth
annua
lized
Cmt/
week
per 1K mbr
Disc/
week
per 1K mbr
Average2305822-Dec-0853%0.962.03
Adv BADMAdvanced Business Analytics, Data Mining and Predictive Modeling (Adv BADM)12181628-Sep-0774%1.861.69
Big Data ASFSSPBig Data / Analytics / Strategy / FP&A / S&OP / Strategic Planning / Predictive & Business Analytics Group (Big Data ASFSSP)9563820-Feb-0982%0.761.54
Big Data & ABig Data and Analytics (Big Data & A)743501-Mar-12100%1.971.86
Biz AnalyticsBusiness Analytics (Biz Analytics)533453-Mar-0843%0.710.62
DM StatData Mining, Statistics, Big Data, and Data Visualization(DM Stat)4376125-Jul-08116%1.923.24
BD ProfBIG DATA Professionals - Architects Scientists Analytics Experts (BD Prof)307921-Sep-0892%0.843.31
NGMRNext Gen Market Research (NGMR) (NGMR)2336826-Sep-0715%2.131.28
SAS A&BISAS Analytics & BI (closed)(SAS A&BI)2094125-Jun-0832%0.480.50
BI&ABusiness Intelligence & Analytics Group (BI&A)200006-Jan-084%0.211.48
Global AGlobal Analytics Network(Global A)1938923-May-0811%0.221.91
ML ConnMachine Learning Connection (closed) (ML Conn)1908712-Mar-0852%0.780.53
Pattern Recognition, Data Data Mining, Machine Intelligence (closed) (PR)162972-Oct-0860%1.060.19
SAS UsersSAS & Analytics Users (SAS Users)1512113-Apr-0834%0.221.20
ActuaryActuary / Actuarial, Predictive Modeling, Data Mining, and Statistics News / Jobs / Careers Group (Actuary)1493024-Sep-0828%0.170.25
RMDSResearch Methods and Data Science (RMDS, former RMA)1492910-Apr-0935%1.051.29
Text AText Analytics (Text A)139472-Jun-0827%0.761.04
DSCData Science Central (DSC)1311210-Feb-1248%0.643.45
Adv AAdvanced Analytics (closed)(Adv A)1202511-Jan-0966%0.982.51
VisualVisual Analytics (Visual)845031-Mar-0850%0.811.80
D&TA ProfData & Text Analytics Professionals (D&TA Prof)800324-Sep-0722%0.321.35
PANPredictive Analytics Network (PAN) (PAN)762316-Mar-0972%0.905.79
Adv APAdvanced Analytics, Predictive Modeling & Statistical Analyses (closed) (Adv AP)727810-Jul-0841%1.060.12
DscientistsData Scientists (Dscientists)70528-Jun-09114%2.186.72
LavastormLavastorm Analytics Community Group (Lavastorm)634917-Apr-1122%0.080.66
RDMRDataMining (RDM)497230-Aug-11126%2.882.63
KDnuggetsKDnuggets Analytics, Data Mining, and Data Science(KDnuggets)48864-Feb-0873%2.579.68
DMTData Mining Technology (closed) (DMT)403420-Jun-0833%0.191.43
PMMLPredictive Model Markup Language (PMML) (PMML)359624-Sep-0918%0.320.23
HealthcareHealthcare Data Mining and Modeling (Healthcare)359411-Jul-0863%0.630.75
BI ToolsBusiness Intelligence Tools (BI Tools)30442-Jul-0842%0.251.81


Note: You can get actual data from the HTML source code of the LinkedIn group Statistics/Activity page. 

Look for dataset seriesName="Comments" and parse that data. Likewise for Discussions and Members. 

Thanks to Anmol Rajpurohit for collecting the membership, comments, and discussions data. 

Here is raw data (csv) for the top 30 LinkedIn groups.

Comments

Popular posts from this blog

Cloud Computing in simple

How to Write an Effective Design Document

Bookmark