Skip to main content
All Posts By

R Consortium

Osun R User Group in Nigeria talks about Spreading the Gospel of R across Africa

By Blog

The R consortium spoke with Timothy Ogunleye on adapting to fit the system during the lockdown. We found out how they were able to adjust and move from having so many in-person meetups to virtual sessions largely through WhatsApp.

Timothy holds a Master of Science (M.Sc.) and Bachelor of Science (B.Sc. – Hons.) in Statistics with 2nd Class Upper Division from the University of Ilorin in Nigeria. Currently, he is close to finishing his doctorate degree under the supervision of Prof. A.O. Adejumo. 

Timothy has also trained at the Department of Computing of Macquarie University, Sydney, Australia, and has been certified as an R and Python Programmer (expert) in the field of Data Science. And he has obtained a national diploma (ND) and higher national diploma (HND) certification in Statistics from the Federal Polytechnic in Osun State in Nigeria.

Tim, as he is proudly called by his friends and colleagues, has at least 15 years of experience in both industry and academia. He has been a monitoring and evaluation (M&E) consultant, data analyst, and epidemiology and health consultant with more than 40 reports produced. He has worked with many local and international NGOs, including the United Nations. In addition, he has published at least 13 journal articles both locally and internationally. Tim is a Member of the Science Association of Nigeria (SAN), the Nigerian Statistical Association (NSA), the Nigerian Association of Mathematical Physics (NAMP); and the International Association for Statistical Computing (IASC).

What is the R community like in Nigeria?

In Nigeria currently, there are a lot of R users and user groups. We have preached the gospel of R to over 2000 people all over Nigeria. Our R symposiums were always packed with 150 – 200 attendees. Our R meetups were held in the Osun state, here in Nigeria. The R user group in Osun has received grants from the R consortium and also the Society of Research Software Engineers (SRSE), IESC. We have conducted a few symposiums using the universities in Osun.

In the past year, did you have to change your techniques to connect and collaborate with members? For example, did you use GitHub, video conferencing, online discussion groups more? Can these techniques be used to make your group more inclusive to people that are unable to attend physical events in the future?  

During COVID19 here in Nigeria, we used Telegram and WhatsApp groups to connect with members. Every Sunday at 9 pm WAT, I would host an online session to teach R for 1 hour. We used the Zoom platform for video recordings and teachings. The classes were recorded with Zoom and shared across Telegram and WhatsApp for more engagement. Whatsapp is used to communicate updates. This is still ongoing, and we don’t take money. 

Can you tell us about one recent presentation or speaker that was especially interesting and what was the topic and why was it so interesting? 

As the team leader of the Osun User group, I am assisted by four other organizers who can also speak at our meetups/sessions. This means there is a lot of variety in the people who speak. One of my favorite presentations was “Data graphics using the Plotly command.” I am especially interested in the Plotly application for graphics. It can do very good pie charts.

What trends do you see in R language affecting your organization over the next year?

The team members are committed and passionate, there is the need for spreading the R programming language. The goal is to teach the R language from secondary schools. It is important to do this as technology is spreading and the need for Data scientists increases. 

R could be introduced early to assist students and help push the R programming language. We want to introduce programming to the Government, so it can be included in the syllabus. The challenge is some high school students do not have laptops for learning.

Do you know of any data journalism efforts by your members? If not, are there particular data journalism projects that you’ve seen in the last year that you feel had a positive impact on society?

Some of my group members have worked with newspaper publications. For a symposium, we needed to have some paid publicity. We got it for free and in return, we wanted to do something for the publication; this was where I started to look at how R can be used in journalism. There was data for different sectors, income, and expenditure, for Nigeria alone. I used R to get graphics and percentages to build data analysis. Since then, I don’t think I’ve worked on a data journalism project.

When is your next event? Please give details!

My user group and I have contacted R experts from other African countries in Kenya, Sudan, and Zimbabwe. We’re planning to conduct data science and machine learning workshops using the R language for Africans. More information cannot be given currently since the idea is still in planning. We are looking to send a grant request to the ISC at the R consortium to help fund this idea. This event is slated for January 2022.

Of the Funded Projects by the R Consortium, do you have a favorite project? Why is it your favorite?

My favorite project is The RECON COVID-19 challenge: leveraging the R community to improve COVID-19 analytics resources. It is a vibrant project that talks about covid 19 and how we can leverage R to fix most of the challenges with COVID19.

Of the Active Working Groups, which is your favorite?  Why is it your favorite?

Well, I’d say my favorite is the R/Business and the R/medicine. They are directly connected to real-world problems.

We are also thinking of starting a working group ourselves. This is still in the pipeline. We need the opinions of other team members to make a final decision, but currently, it’s not something I can say so much about. 

Oslo UseR! Group’s Diverse and Inclusive Environment Has Fostered a Resilient R Community

By Blog

R Consortium talked to Raoul Wolf of the Oslo UseR! Group about the wide adoption of R in Norway, both in academia and industry. He explained how the pandemic initially hindered activities of the group, but they bounced back. The group took the opportunity to collaborate with other R communities and invite high profile international speakers for virtual events.

Raoul currently works as a Senior Advisor and Digital Developer within Sustainable Geosolutions at Norwegian Geotechnical Institute. Hailing from Germany, Raoul was first introduced to R when he began his PhD studies in Norway eight years ago. 

When he’s not coding in R, Raoul enjoys the cultural life in Oslo and is really fond of the fantastic museums and the vibrant culinary scene. 

What is the R community like in Norway?

I cannot talk on behalf of the entire R community in Norway, as we have several hubs of the R community. Oslo, as the capital of Norway, has the largest R community. But the R communities in other cities, e.g. Bergen, Trondheim or Tromsø, are also doing great things. 

What I like about the R community is how very welcoming it is. This is true not only for Oslo, but all over Norway. It is a diverse community in terms of cultural backgrounds, professional backgrounds and identities. My professional background is from academia, and we have a vast base in academia in Oslo, but also people from consultancies and the industrial sectors. 

We are lucky to be in the capital city here and can collaborate with the different communities that are already here and using R.

I need to mention that while I am the main organizer of our meetup group in Oslo, I have two wonderful co-organizers, Bethan Cropp and Henrik Galligani Ræder. At a certain point, both were regulars at our events and wanted to take a more proactive role in the community building here, and that’s a blessing. Between the three of us, we recruit speakers and organize events.

In Oslo, we not only have the UseR! Oslo meetup group, but also a great R Ladies meetup group. Across Norway,  other R Ladies groups are really active as well, and we have had great contact with R Ladies Oslo and co-organized events. So we are really happy with this constellation in Oslo. 

How has COVID affected your ability to connect with members?

Massively. It was an enormous challenge initially, but also presented new opportunities. Before the pandemic, we only had physical meetups. When the pandemic hit Europe in March 2020, we had events planned through the summer with confirmed speakers. We postponed our March event, hoping things would get back to normal in April or May. Luckily, Oslo useR!’s previous organizer Dmytro Perepolkin was able to present virtually on short notice, and subsequently we were able to transfer some of our planned physical events to virtual events.

At the end of 2020, we noticed that more and more meetups across Europe have turned virtual. We connected with a few of those meetup groups, but also reached out to international speakers who we thought would be interesting for our Oslo UseR! group.

In January 2021 we had our first “international” speaker, and it was a success event. We followed this pattern to mix up our local talent with some high-profile international speakers, and this has worked well for us. We have seen an increase in the number of people joining the meetup group and in the number of interactions. Along the way, we also figured out practicalities like moderating a chat box.

There was a steep learning curve, but the numbers went up considerably, and now we have over 1,900 members. It is very encouraging having 200+ people participating in our meetups. We started recording our meetups and put them on YouTube. Everyone can go back and watch the videos if they missed out on an event.

In the past year, did you have to change your techniques to connect and collaborate with members? For example, did you use GitHub, video conferencing, online discussion groups more? Can these techniques be used to make your group more inclusive to people that are unable to attend physical events in the future?  

We used the six months to figure out how we can continue with our meetups. We are lucky enough to have a colleague that is affiliated with the University of Oslo and they, very early on, had a professional Zoom license. At the beginning of the pandemic, it was a bottleneck to get people on a video conference like that, and we were lucky enough to have early access to such a platform. We quickly started reaching out to our local talent pool we were in contact with from before.

We are currently in contact with a tech community building where we plan to host our events once we go back to physical events. Additionally, we are exploring the possibility of hybrid events. Ideally we will have physical meetups with livestreams and recordings. This emphasizes our wish to stay in contact with as many people as possible, and to be more inclusive. 

Can you tell us about one recent presentation or speaker that was especially interesting and what was the topic and why was it so interesting?  

There were a lot of interesting presentations this year. The one that impressed me the most was the one we had in January, which was our first international meetup.

The speaker was Paul Bürkner from Germany, and he talked about Bayesian Multilevel Modeling and presented his brms package. I have used the package before, and it is always a treat to have developers present their own packages. We got a lot of feedback and interaction in the chat box, and had an excellent discussion throughout.

What trends do you see in R language affecting your organization over the next year?

That’s an interesting question! There are several parallel trends in R depending on your point of view. There is the ongoing trend of tidyverse that continues to develop into more ecosystem, like tidymodels. We get a lot of requests at the meetup for integrating R in a larger environment with databases, APIs and visualization tools. Besides the stereotypical use of R in academia, there is an increasing demand for using R in production and business use cases. Data journalism is also becoming more important, and there are some newspaper houses in Norway that are using R.

I don’t know if it qualifies as “trend”, but the positive spirit of dissemination and inclusion in R and the R community is really making a difference. We all witnessed how important it was during the pandemic, and it will be of equal importance moving forward.  

Do you know of any data journalism efforts by your members? If not, are there particular data journalism projects that you’ve seen in the last year that you feel had a positive impact on society?

The Norwegian Institute of Public Health has presented some of its COVID research and predictions using R. The modeling and visualizations were done in R. It was one of the biggest impacts locally to see R prominently displayed in the public. 

When is your next event? Please give details!

Our next meetup, “Wrapping Packages in R with {devtools} and Friends”, is scheduled for 16th of December. It will be our annual holiday season meetup, which traditionally is a bit more light-hearted. The first half will be about “wrapping packages,” demonstrating how to make packages in R. Afterwards, we want to have an open discussion with our community about how they have experienced the last year.

For next year, we are really excited about the planned events until spring. There are several interesting talks in the pipeline, and we will announce them as soon as we have finalized all details.  

Of the Funded Projects by the R Consortium, do you have a favorite project? Why is it your favorite?

Two projects, actually. One is the project financing the webchem package. I have used the package in my academic work, and have directly benefited from having it available.

The other is the consolidation of R Ladies groups. It is both unique and beautiful, and demonstrates the idea of R being a welcoming and inclusive environment.

Of the Active Working Groups, which is your favorite? Why is it your favorite?

The R Community Diversity and Inclusion working group undoubtedly one of them. Even though I don’t have a direct connection, I think the R Validation Hub for the pharmaceutical industry is important. To get into regulatory territory is a huge step for any programming language, and I am happy to see that R is moving in this direction. 

Successful R-based Test Package Submitted to FDA

By Blog

The R Consortium is happy to announce that on Nov 22nd, 2021, the R Submissions Working Group successfully submitted an R-based test submission package through the FDA eCTD gateway! The submission package has been received by the FDA staff who were able to reproduce the numerical results.

All submission materials can be found at: https://github.com/RConsortium/submissions-pilot1-to-fda

The pilot 1 test submission,  was an example submission package following eCTD specifications which include a proprietary R package, R scripts for analysis, R-based analysis data reviewer guide, and other required eCTD components. To our knowledge, this is the first publicly available R-based FDA submission package. We hope this submission package and our learnings can serve as a good reference for future R-based regulatory submissions from different sponsors. Additional agency feedback will be shared in future communications.

About the R consortium R submission working group

The R Consortium R Submissions Working Group is focused on improving practices for R-based clinical trial regulatory submissions.

To bring an experimental clinical product to market, electronic submission of data, computer programs, and relevant documentation is required by health authority agencies from different countries. In the past, submissions have been mainly based on the SAS language. 

In recent years, the use of open source languages, especially the R language, has become very popular in the pharmaceutical industry and research institutions. Although the health authorities accept submissions based on open source programming languages, sponsors may be hesitant to conduct submissions using open source languages due to a lack of working examples.

Therefore, the R Consortium R Submissions Working Group aims at providing R-based submission examples and identifying potential gaps during submission of these example packages. All materials, including submission examples and communications, are publicly available on the R consortium Github page: https://github.com/RConsortium

The R consortium R submission working group includes members from more than 10 pharmaceutical companies, as well as regulatory agencies. More details of the working group can be found at: https://rconsortium.github.io/submissions-wg/

The R consortium R submission working group is open to anyone who is interested in joining. If interested, please contact Joseph Rickert at joseph.rickert@rstudio.com

Need to Code a Difficult Pharma Stats Table? The R Tables for Regulatory Submissions (RTRS) Working Group Wants to Know

By Blog

The R Consortium’s R Tables for Regulatory Submissions (RTRS) Working Group has made considerable progress in identifying and working through the issues involved with developing a modern R based framework for creating tables. The goal is to be able to make it easy for statistical programmers working in pharmaceutical companies to find the right R resources for creating any type of table that may be required to support a submission to the FDA or any other regulatory agency. 

Currently there are at least six R packages that have the functionality to support some portion of the panoply of tables that might comprise an essential part of a statistical report. These packages include

  • flextable
  • gt
  • huxtable
  • mmtable2
  • rtables
  • Tplyr

The trick is to establish a framework that makes it easy to use the combined features of relevant R packages to produce any table that is likely to show up in a production environment.  

RTRS wants examples of tables, those that are already part of your standard statistical submissions, and those that you would use if you could make them. 

              A: Drug X     B: Placebo    C: Combination
                (N=134)        (N=134)        (N=132)    
---------------------------------------------------------
AGE                                                      
  n               134            134            132      
  Mean (sd)   33.77 (6.55)   35.43 (7.9)    35.43 (7.72) 
  IQR              11            10              10      
  min - max     21 - 50        21 - 62        20 - 69    
BMRKR2                                                   
  LOW              50            45              40      
  MEDIUM           37            56              42      
  HIGH             47            33              50      

We want to make sure that we understand and explore the entire, conceivable space of production tables and want to see to which extent we are already able to build them in R. So please send us your tables. 

We will take them any way that is easy for you to provide. Open up an issue on our GitHub repository (https://github.com/RConsortium/rtrs-wg/issues) and drop them in. Something computable would be best, but we will take text or even screenshots. If there is some part of your example table that is particularly vexing to create, please point that out. Be careful not to include any proprietary information.

If we can already build your difficult table, we will show you how to do it in R. This is meant to be a benchmarking process to showcase abilities of the packages so far and to reveal current limitations. If it turns out that we cannot build your table with R’s current table making capabilities, then there is a good chance that we will add it to our “To Do” list.

If you think that you would like to become involved with our work, please let us know that, too. Include your email address and we will invite you to the next meeting.

R Skopje in North Macedonia Talks About the Challenges of Virtual Events for Smaller Communities

By Blog

The R-Consortium talked to Novica Nakov of R Skopje about the challenges of managing an R User Group during the pandemic. Novica told us about the budding R community in North Macedonia and how shifting events online has not been useful for this tightly knit community. However, Novica hopes things will return to normal once they resume events in a physical space.

Novica is a long term free software enthusiast and has been working with the free software community in Macedonia since 2001. Originally from a Social Sciences background, Novica initially contributed to the community mainly on policy issues, legal issues and localization. He later learned R-programming during his postgraduate studies in the UK.

Before forming the R Skopje User Group, Novica was also involved in establishing a hacker space Kika in Skopje 10 years ago. This hacker space hosted Linux help forum days, general knowledge sharing events and one day conferences.

What is the R community like in North Macedonia?

North Macedonia is a small country, so it is lagging in certain areas as compared to the US. Before 2018-2019, there were very few Data Scientist or Data Analyst jobs available in the country. However, there has been a general shift in public interest ever since. The R community in Macedonia is still in its early stages.

How has COVID affected your ability to connect with members?

I started the R User Group in 2018, hoping to provide a knowledge-sharing space for people like myself who do not have a technical background but need to use a programming language for data analysis. We formed the User Group within the hacker space, as there was an established infrastructure available.  

Before the pandemic, we were having regular meetings on Tuesdays. These meetings were like hands-on workshops where people helped each other with problems they faced while working with R. We also had presentations where members shared their projects with the group. Government ordered a lockdown in early 2020, and we had to shut down the hacker space. We also had to cancel the lease on the space.

As it was a tight-knit community which thrived on one-to-one interactions, lockdown significantly reduced the quality of communication between the members.

In the past year, did you have to change your techniques to connect and collaborate with members? For example, did you use GitHub, video conferencing, online discussion groups more? Can these techniques be used to make your group more inclusive to people that are unable to attend physical events in the future?  

After the lockdown, we had 2-3 online events that were R-specific in which different speakers presented. Besides that, we also moved all our regular events to Google Meet, but it didn’t work really well. Since it was a small community, people spent most of the time catching up with each other and had very little interest in the actual slides, etc.

We are talking about a really small country and a really small pool of people interested in data science. Just to give you an idea about the scale, there is a group called Data Science Macedonia, where the main organizer is a Macedonian company ScaleFocus. They used to host events at a local university which attracted around 150 people. Companies such as Slice used to present in these events and discussed possibilities of employment, which was interesting for people looking to start working in the data field. 

Our group, on the contrary, consists of people who are simply curious to learn new programming languages, etc. for personal use. I think online meetings are not really good for such a group, because they don’t allow for person-to-person interactions for solving a particular problem. 

We attempted having online workshops, but it didn’t work out as it required a lot of troubleshooting, which is really difficult unless you have physical access to the participant’s system. I think online meetings are not great for workshops. 

We started talking about reopening the space, as many people are missing the ability to get together. I don’t know if this will happen, but if it does, we will resume our events in the physical space. But I am not very optimistic about the online events. 

Can you tell us about one recent presentation or speaker that was especially interesting and what was the topic and why was it so interesting? 

I think the last presentation we had was by a Financial Data Analyst, Ljupcho Naumov, and the topic was optimizing your portfolio using R. This event turned out to be a huge success and around 300 people joined the online event. 

I am not sure if people joined the event to learn R or to get free financial advice about getting rich. Another reason for the success of the event was that it coincided with the central bank lifting the ban on personal accounts for trading in foreign brokerage companies.

What trends do you see in R language affecting your organization over the next year?

Given the situation, I don’t think R or development in any language would be as important as dealing with the health crisis.  

Do you know of any data journalism efforts by your members? If not, are there particular data journalism projects that you’ve seen in the last year that you feel had a positive impact on society?

One media organization in Macedonia sometimes runs data driven stories, and I think they are connected by the European network called Organized Crime and Corruption Reporting Project (OCCRP).

Macedonia has joined this open data initiative recently and one non-government organization is running a project on creating datasets from local governments governance organizations.  

There’s also this project I was working on where we were trying to create local datasets from data from local governments. I guess this is something that can potentially be used in journalism. 

When is your next event? Please give details!

We don’t have a specific event in mind at the moment. However, as a lot of members in our events are women, I am hoping to connect them to R Ladies. R Ladies Belgrade is very active, and maybe we can have some sort of collaboration with them. I am also really interested in opening an R Ladies branch as we have this pleasant history of women getting involved in our space.

R Consortium Funding Delivers New Format for the R Journal

By Blog

By Di Cook, Mitch O’Hara-Wild, H. Sherry Zhang, Stephanie Kobakian, Michael Kane, Catherine Hurley, Simon Urbanek

The R Journal is a primary outlet for publishing research of interest to the R community. It publishes articles related to R packages (detailing the the broader context, implementation details and examples of use), applications, comparisons and benchmarking and reviews. The journal was born in 2009, superseding a long-running newsletter, to provide an outlet documenting advances in statistical computing, discuss issues and research opportunities at the intersection of statistical analysis and software, and to encourage awareness of these advances and opportunities within the community.

Articles have traditionally been written using latex, and outputted to pdf, with code and data to reproduce results provided separately. Sweave emerged later as a commonly used dynamic document system and allowed R code to be embedded directly into document whose output was tex document that included output (including tables and graphs) generated from embedded R code. This was essentially an accepted pre-processing step resulting in a tex conforming to the structure required by the R Journal.

The operations of the R Journal are entirely run using R itself and, to our knowledge, it is the only journal of this kind. Common operations performed by editors including acknowledging submissions, sending review requests, organising reviews, emailing authors about their paper’s status, compiling articles into an issue, and populating the web site are all contained within the rj package, which is available on Github. It’s lovely to contemplate; R is the editorial workflow system for the R Journal.

The journal is produced entirely by volunteers, like much of the R community. It is an ever-changing group of people who get involved because they care about the community and the availability and survival of a publishing outlet for research related to R. Some members are lucky to have the support of their workplace, usually academic, and juggle the responsibilities with teaching, their own research or consulting. The editorial responsibilities include updating editorial management software (with the rj package) as well as managing article submissions.

Inspired by a talk by Yihui Xie in 2018 “Towards An Open-access, Fast, and Reproducible Journal”,and knowing that she would be committing to be an editor of the R Journal from 2019, Di submitted a grant to the R Consortium requesting funds to experiment with changing the format and operations. It was successfully funded at $50,000USD to support the “hiring of an editorial assistant, and to experiment with alternative operations to streamline checking compliance of submitted papers, soliciting, constructing and returning reviews, copy-editing, construction and web delivery of new issues”.

Central to the new format was utilizing Rmarkdown for document writing. Just reviewing Yihui’s slides I now see the wonderful gif describing a latex foundation (slide 1630) document, and I can confirm that the first year as Editor-in-Chief (this year) that working with the the rj functions on the latex files to build an issue was very much like this, one little bump brings the whole system down. Rmarkdown is a much friendlier version of dynamic document building than Sweave, and writes to various outputs, including html, thanks to pandoc. It is much more elegant for graphics – which is dear to my heart. With html format there is also the prospect of including interactive graphics.

We are now happy to announce that there is a new way to write R Journal articles, that deliver them in html. The new web site can be viewed in development version, with a goal to rolling it out as the main site with the December issue this year.

The new web site has these features:

  • revised overview of the journal
  • updated instructions for authors
  • html formatted articles submitted as Rmd, in addition to the pdf format.

There are numerous benefits of the new html formatted articles, which include the ability to include interactive graphics, and that it is a more accessible format for the blind members of the R community. (Jonathan Godfrey is suitably excited!) To get a glimpse of what is possible with interactive graphics, take a look at the article Conversations in Time where you can find embedded, a plot with linked brushing between multiple plots, and a movie.

The way to get started yourself is to use the rjtools. The function create_article() creates a template in your project, containing a variety of key points, and an interactive plot made with ggplot2 and plotly. This package also allows you to check your article, prior to submission, a little like the way the devtools package helps to check your R package prior to submitting to CRAN.

In the interim, the funding has supported numerous developments to the main workhorse package for operations, rj. This includes:

  • A new submission system: amazingly the previous system was running on submission to a wufoo form hosted by the 2013 editor, Hadley Wickham. The new system uses a Google form, and populating into a Google spreadsheet, owned by current editors, with ownership migrating to new editors as needed.
  • pkgdown web site that includes better documentation on operations.
  • New functions like match_keywords() which allows finding reviewers whose expertise matches the the keywords for the article.
  • New functions for paper management such as tracking articles, and finding articles that are slow-moving in the repository.
  • A new issue building system. This is such a wonderful change, and a relief to the huge time commitment previously needed. With the new system it should be possible to bring out more than two issues per year to handle the larger number of articles.
  • A host of new functions to check articles, that have been migrated now to the rjtools package.
  • New functions to manage Associate Editor and reviewer load.
  • A system to manage the new process including Associate Editors in the review workflow.

The funding has also provided intermittent editorial assistance. Initially, editorial assistant converted the long, long author guidelines into a check-list, multiple check-lists. An initial check-list was used for identifying articles that were not conforming to the guidelines when they were submitted. The second check-list is targeted to the article proofing prior to publication. Authors are provided with this list when their paper is accepted and asked to check their paper carefully, which has helped reduce the load from the editors. The first issue of 2021 was also copy-edited by the editorial assistant, quite painstakingly detailed suggested edits as red-inked pdfs sent to the authors to correct. We hope that you can see the language improvement in these articles.

In the near future we hope to develop the journal operations in two directions: a system for authors to track the progress of their paper through the review system, and a web-based system to help editors manage reviews.

To summarise, the funds from the R Consortium have provided the freedom for the R Journal editors to experiment, and dramatically improve the operations.

Eskisehir R Users Group in Turkey Talks About Pushing Boundaries During the Lockdown

By Blog

The R consortium talked to Mustafa CAVUS of the Eskisehir R Users Group (also on Twitter) about handling group activities during the pandemic. Mustafa shared that online events during the pandemic have allowed their group to evolve as a global forum for Turkish R users and hybrid events in the future will help them further increase their reach. 

Mustafa is working as an Assistant Professor at Eskisehir Technical University, Department of Statistics and is also a visiting Postdoc Researcher at Warsaw University of Technology, Faculty of Mathematics and Information Science. His research interests are in the areas of AutoML and Explainable AI.

What is the R community like in Turkey?

There are many R users in academia and industry in Turkey. Many people have become interested in package development and Shiny development beyond being a user. Especially in recent years the interest I have seen in the use of R in several fields in Turkey amazes me. From a statistician’s point of view I can see some interesting examples in urban planning and landscape architecture using geo-spatial data. I can say that there is a really large number of R users and enthusiasts in Turkey.

How has COVID affected your ability to connect with members?

Actually our user group was founded just before the pandemic. When we were planning our first meeting, we were unable to plan a face-to-face event due to the pandemic and we conducted all our activities online. Although this reduced the overall quality of communication, it also enabled us to meet many people outside of Eskisehir. 

In the past year, did you have to change your techniques to connect and collaborate with members? For example, did you use GitHub, video conferencing, online discussion groups more? Can these techniques be used to make your group more inclusive to people that are unable to attend physical events in the future?  

We certainly use these methods e.g. video conferencing through Zoom is used often. I think many events will be planned as hybrids in future as we get better at using these techniques. And we are also using virtual tools to provide easy access to many people all over the world. We produced some content in the pandemic period for our Youtube Channel in Turkish with Uğur Dar, co-organizer of our community..  

Can you tell us about one recent presentation or speaker that was especially interesting and what was the topic and why was it so interesting? 

As an Eskisehir R user group we had the opportunity to get together with R users from different cities in Turkey and even the world at the events we started. Over the time our group has evolved into a forum that is not just local. 

We turned this harmony which we achieved together with Turkish R users living in different countries into a great environment. 

The online conference we held in April 2021 with the support of Why R? Foundation and, of course, the R consortium, was the largest R conference held in Turkey to date with over 2500 registered and 800 on spot participants. We as an organization committee wrote about the details and our experiences in the “Conference Report of Why R? Turkey 2021 (PDF)” which was published in the latest issue of the R Journal also. 

I think that each presentation at this event was interesting because it appealed to different audiences.

What trends do you see in R language affecting your organization over the next year?

Actually I know that with the conference held in April 2021 many researchers and professionals from industry started to work together by using R. My expectation for the next year is that this interaction will continue to increase, and we will see interesting usage examples of R in several disciplines. 

Do you know of any data journalism efforts by your members? If not, are there particular data journalism projects that you’ve seen in the last year that you feel had a positive impact on society?

As far as I know, not in our community, but there is a quite interesting project on data journalism in Turkey called VeriPie. A group of undergraduate students from the department of statistics of Middle East Technical University are engaged in up-to-date data journalism practices mostly using R.

When is your next event? Please give details!

Actually, we are working on an online conference that we plan to organize for Turkish R users in the second quarter of 2022. The overwhelming interest in the event we organized last year motivated us to organize a more extensive event. It will consist of lightning talks, regular talks and also invited speakers. Last year our event just consisted of invited speakers. 

Of the Funded Projects by the R Consortium, do you have a favorite project?  Why is it your favorite?

This is a really tough question I think. My favorite is the RECON COVID-19 Challenge. It is a really promising project and actually I had interest in it but I couldn’t find enough time to contribute. 

Of the Active Working Groups, which is your favorite? Why is it your favorite?

I would say R Medicine because I have been working on Medical datasets lately in my postdoc period. 

Birmingham R talks about the difficulties of socializing in an online space

By Blog

One of the difficult parts of running a group in an online space is maintaining social interactions that you would normally foster with in-person meetups. R Consortium talks to Adnan Fiaz about how he is attempting to create those interactions in the online meetups. 

Adnan is Senior Data Scientist at National Grid in Birmingham, England. He is an analytics professional with a passion for mathematics and complex challenges. And outside of work and R, has a keen interest in playing football, cinema and general aviation.

RC: What is the R community like in Birmingham?

AF: I took over about 3 years ago. Before that, there were several meetings, but they slowed down quite a bit. I came in with quite a few new ideas. We started with 1 or 2 meetings every quarter. We had a good rhythm until 2020. It was getting harder to get speakers, but I was able to find them. We were able to have about 20 people attend the meetings and that was quite good for Birmingham. It was a mix of academics from the local university, NHS staff from the area, and scattered R Users that used it in their businesses. We also had new users and people who had been using it for years. It was very diverse.

RC: How has COVID affected your ability to connect with members?

AF: We were struggling in the beginning because we didn’t know what to do. It depends on how you deal with it as an organizer and a community builder. I was leaning on face-to-face contact and others to help me out. Online, it was harder to engage with people and to ask for speakers from the local community. We didn’t have a meetup for 4 or 5 months. In the autumn we had a meetup. Then we didn’t have anything until the winter. At the beginning of this year, we decided to do meetings online and jumped on the bandwagon of the Global R community. We advertised their meetings on the Birmingham R page to give the community something to watch. Then we organized a meetup of our own in between their events. There is a lot less interaction with the members this way. People tend to be less interactive in online meetings and spaces. Since you must put a lot of effort into forcing socialization in these online spaces. I am looking forward to being able to go back in person.

RC: In the past year, did you have to change your techniques to connect and collaborate with members?  For example, did you use GitHub, video conferencing, online discussion groups more?  Can these techniques be used to make your group more inclusive to people that are unable to attend physical events in the future?  

AF:  Slack, Twitter, and Zoom are the technologies that we use mainly. We also have a GitHub page that we use. These allow a lot of people to attend our online meetings.

RC: Can you tell us about one recent presentation or speaker that was especially interesting and what was the topic and why was it so interesting? 

AF: We had several from the local meetup. We had a presentation/workshop from Birmingham University about mixed effect models by Bodo Winter. He explained mixed effect models from the basics of how they work as well as the more complex models that can be done. Once people had an understanding, they were able to ask more pointed questions. I was surprised because there was more engagement in the second part, mostly because people understood the concepts. There were a lot of questions, and people seemed to take it in a good way.

RC: What trends do you see in R language affecting your organization over the next year?

AF: We will probably see more people branching into how to build different statistical models. In the last year, we saw packages brought into the tidymodels framework and building upon caret and splitting it, and building specific parts. In short, having better support for model building.

RC: Do you know of any data journalism efforts by your members?  If not, are there particular data journalism projects that you’ve seen in the last year that you feel had a positive impact on society?

AF: I think one that got a lot of attention was the covid visualization by the Financial Times by John Murdoch. They were very informative. He also spent a lot of time discussing how he created them and engaging with everyone on Twitter.

RC: When is your next event? Please give details!

AF: November 18th. We will be meeting in person.

RC: Of the Funded Projects by the R Consortium,  do you have a favorite project?  Why is it your favorite?

AF: The most useful one to me is the R Ladies Project. I have used the materials from that project to start the meetup again as well as tips to increase engagement.

RC: Of the Active Working Groups, which is your favorite?  Why is it your favorite?

AF: I think the most interesting one is the R Certification. I remember when it was first proposed that it would be useful for meetups to give them a framework. We started with small segments before the meetup to start learning R in a beginner’s course. Just 10 minutes before the meetup to warm up. The R Certification would help guide that.

RC: There are four projects that are R Consortium Top Level Projects. If you could add another project to this list for guaranteed funding for 3 years and a voting seat on the ISC, which project would you add? 

The current four projects are:

AF: I saw some work from Heather Turner on the future of R developers. That would be interesting to get more focus on because it would be good if we had more diversity in the core team of R Development.

How do I Join?

R Consortium’s R User Group and Small Conference Support Program (RUGS) provides grants to help R groups around the world organize, share information and support each other. We have given grants over the past 4 years, encompassing over 65,000 members in 35 countries. We would like to include you! Cash grants and meetup.com accounts are awarded based on the intended use of the funds and the amount of money available to distribute.  

Chile’s R Ladies Group is Open and Inclusive

By Blog

To understand what R Ladies in Santiago is like, R Consortium talked to Riva Quiroga about how they are dealing with organizing and meeting during the pandemic. We also discussed how the chapter is becoming more inclusive and helpful to others in Santiago, as well as all of Chile and Latin America as a whole.

RC: What is the R community like in Chile?

The R community in Chile is very active and diverse. We have members who come from different fields and who have very different interests. There are members who have a STEM background, but also a lot of people from social sciences and the humanities. And they work in very different places: academia, industry, public services, ONGs, students, etc. There are currently four active RLadies Chapters, and one R User group. 

The R user group in Chile started in 2012 in Santiago. I think I found out about them around 2015, but never attended an event because I wasn’t sure I would feel welcomed (it was an all-male group and I was just a beginner), and then they stopped organizing meetings. In 2017 they resumed their activities and we, the team that was planning the launch of RLadies Santiago, started attending. They supported us when we were starting our chapter in 2017, by helping us find venues for our first events. Since then, we see each other as collaborators. We have organized joint activities, and organizers and members of both groups have collaborated together in R-related projects (such as packages, courses, etc.)

2017 was the year the R community started growing at a very fast pace here. Chile is a very centralized country, so everything usually happens only in Santiago, the capital city. So it was great to see in the next few years new RLadies chapters in other parts of the country: Valparaíso (2018), Concepción (2019), and Talca (2020).  

RC: How has COVID affected your ability to connect with members?

We were a very active group until October 2019, when we had to stop our activities due to social unrest in the country. Probably because it was a very difficult time for everyone, it didn’t occur to us to organize online events. 

And then came COVID. Working remotely and online events became the “new normal,” so we decided to resume our activities. We saw this as an opportunity for collaboration between all the RLadies chapters. So since March 2020 all our activities have been branded as “RLadies Chile.” All our events are held via Zoom, thanks to the licenced account provided by DataUC. We post our videos on Vimeo and use GitHub to share code and materials. 

Online events have been a great opportunity to make our community grow. We have been able to reach people in cities that currently don’t have an RLadies chapter, and also people who were unable to attend in-person events. 

This means that our “local” community is now bigger than before. It is no longer limited to the four cities that have an RLadies chapter. People from different parts of Chile and Latin America are joining our events, and even spanish-speaking folks who live around the world. As a consequence, when possible, we try to organize our events in time slots that are not too late for someone based in Europe, and not too early for someone in México. 

This collaboration between the four RLadies chapters to organize online events has been a great experience. On the one hand, it allowed us to connect with a broader community in a new way, so we plan to keep organizing online events even when meetings in person are back. On the other hand, we as organizers became closer. At least for me, having the opportunity to share time with such an awesome group of people has been one of the things that motivates me to keep going during these difficult times.  

RC: Can you tell us about one recent presentation or speaker that was especially interesting and what was the topic and why was it so interesting? 

A couple of months ago we ran an event about how we have been using R during the pandemic, and what new things we have discovered and learned. In that context, Alejandra Silva Tapia, the organizer of RLadies Talca, gave a talk about sonification techniques with R. In her presentation she not only showed some explorations she did with meteorological data using packages like {tuneR} and {sonify}, but also the teaching potential of these techniques. She shared her experience sonifying plots in order to explain statistical distributions to blind students. With just a couple of lines of code, she gave attendees a tool to make their learning materials more accessible. 

It was so interesting that in our internal Slack we added a new channel to share our sonification experiments and new ideas on the subject.

RC: What trends do you see in R language affecting your organization over the next year?

There are currently more than 18000  packages on CRAN, and many of them are very field-specific, so it is very difficult to keep up to date with all the new possibilities that they offer. Therefore, it is very challenging to decide what new workshop to offer; what new package to share with our community. Should we run a workshop about something broad and general that might benefit anyone? Or do we target a specific audience that will benefit from learning about new packages or techniques for their field? 

To face that challenge we have been trying to do a mix of both. We have organized workshops focused on general tasks, such as cleaning data, modeling, visualizing, etc., and also subject specific events. Organizing both types of workshops (general and specific) has been our way to attend the needs of a very diverse audience. 

Another trend we are very happy to see is the discussion around diversity, inclusion, accessibility and algorithmic bias in Data Science. We are currently running a book club based on the book Data Feminism to discuss some of these topics. Discussing the social and ethical issues involved in coding and data science is something that interests all of us. And a safe space like RLadies is a great place for starting that conversation. 

In regions like Latin America the decision about what workshops to offer is not trivial. Here, being able to understand English is, in most of the cases, a sign of privilege: you went to a private school, you had the opportunity to study abroad, you work in international projects,  people in your family speak English, etc. And that is not very common. This means that RLadies chapters and R Users groups are sometimes the only place for many non-english speakers to learn about the new developments in R and Data Science. So when deciding which workshop to run, we have this in mind. We see this as part of our mission. 

This was also the reason why the Latin American R community translated, as a joint effort, the book R for Data Science into Spanish. We saw the need (and impact) of having this kind of resource available for everyone who has interest in learning R. 

Do you know of any data journalism efforts by your members?  If not, are there particular data journalism projects that you’ve seen in the last year that you feel had a positive impact on society?

We have had journalists attending R Ladies events in the past, and we are very happy to see that some of them, who were also professors, started promoting the inclusion of R as part of their undergraduate curriculum. 

Regarding the second question, there are three data journalism projects that have had a positive impact on society here in Chile, from my point of view. The first one is La bot, a Telegram and Facebook Messenger bot that sends you short and precise data based analysis of current issues. This is a women-led project that has received support and funding from the International Women’s Media Foundation and the Open Society Foundation. The team has made an amazing job showing new ways in which journalists can seek and connect with different audiences. And by always discussing current issues supported by data, La bot has also been a great way to fight the spread of fake news.  

The second one are the reports Alejandra Matus did in the first months of the pandemic. She explored the death rates in Chile for the past ten years and exposed that the government was underreporting COVID deaths. She revealed that there was an excess death rate for March 2020 that the authorities were not explaining. They were only reporting PCR-positive patients who died at hospitals, not people who were dying at their homes or elderly nursing homes. Her work had a great impact. Not only because we started demanding more transparency from the government regarding COVID data, but also because many people began to realize the seriousness of the pandemic. 

The third one is called Plataforma Telar. Chile is currently drafting a new constitution and, in this context, Plataforma Telar is using innovative methodologies to gather and analyze data related to this process (and they are using R!). What I find really interesting is that, although this is an interdisciplinary project based in academia, they have made alliances with networks like CNN Chile to showcase their findings to reach a broader audience. 

RC: When is your next event? Please give details!

Our next event is the sixth session of the book club about “Data Feminism”, which will be held in late November. For December we are planning a workshop about building your first R package and one about using git/github in RStudio.

We have been able to be a very active chapter, even during the pandemic, mainly because of two reasons. First, because all our activities are the joint effort of the four RLadies chapters of our country. That makes all the planning easier and keeps us motivated. Second, because we plan the workshops not only taking into account what we already know, but also what we want to learn. For example, if I want to learn about a specific package, I will volunteer to run a workshop about it in a couple of months. That way I have an incentive and a deadline to achieve that objective. Because R Ladies is a collaborative and safe space, we feel comfortable running events that are not about something that we have already mastered, but about something we are currently learning.

RC: Of the Funded Projects by the R Consortium, do you have a favorite project? Why is it your favorite?

Obviously, the R Ladies project is very dear to my heart. The support of the R Consortium has been crucial to offer current and prospective chapters the human and technological support to operate. 

I also want to mention the SatRdays project led by Steph Locke and Gergely Daroczig. They developed a starter kit, a knowledge base, and all the infrastructure you might need to run your own SatRday.

SatRdays are accessible R-focused conferences organized by local R communities, that are held on Saturdays. In 2018 RLadies Santiago and the Santiago R Users group organized one of these events, and we wouldn’t have been able to do it without all the support this project provided. That event was very important for growing our local community, so we planned another one for April 2020. But we had to suspend it due to the pandemic. We expect to be able to run it again in 2022.

RC: Of the Active Working Groups, which is your favorite?  Why is it your favorite?

I’m not sure if I have a favorite one, but I really like the idea of working groups that are focused on specific fields, like R/medicine and R/pharma. They are a great way to bring together people that are using R for similar purposes to collaborate on events and advocacy, and to make advances in different areas by promoting cooperation.

It would be great to see in the future similar working groups for other fields (e.g. R/social sciences, R/humanities, R/ecology, R/open government, etc.). 

RC: There are four projects that are R Consortium Top Level Projects. If you could add another project to this list for guaranteed funding for 3 years and a voting seat on the ISC, which project would you add?

I would love to see a project about multilingualism. Currently there are many people working toward this aim, and not only by translating learning resources, but also by developing packages that take into account that English is not the only language that exists. For example, Michael Chirico has made a package called potools which allows you to internationalize your own package by translating user-facing communications (e.g., warnings, errors, etc.) into different languages. Also, the Datasketch team (Colombia) developed a package called Shi18y, that allows you to create multilingual Shiny apps. And a group of RLadies from Brazil and other countries from Latin America are currently developing a package with datasets in Portuguese for people to use when teaching/learning R, similar to the ones that already exist for Spanish and Turkish.

All these are great efforts that are helping to make R more accessible to non-English speakers. It would be great to see a Top Level Project that promotes these kinds of initiatives. 

How do I Join?

R Consortium’s R User Group and Small Conference Support Program (RUGS) provides grants to help R groups around the world organize, share information and support each other. We have given grants over the past 4 years, encompassing over 65,000 members in 35 countries. We would like to include you! Cash grants and meetup.com accounts are awarded based on the intended use of the funds and the amount of money available to distribute.

Starting the South Coast MA UseR Group: A first year in review

By Blog

By Amanda Hart, Co-organizer of South Coast MA UseR Group

2020 was a year of firsts: first global pandemic in my lifetime, first year working remotely, first year not traveling home for the holidays, and on a more positive note the first year for the South Coast MA UseR Group. Just before lockdown I had the opportunity to join an Openscapes workshop led by Julia Stewart Lowndes. I spent the week thinking about open data science and learning about tools to aid in collaboration, but I never expected to need to exercise those tools so quickly or extensively. Overnight my community changed. Offices closed, schools closed, classes turned into remote learning experiments, and my community of students and coworkers scattered. This is when the South Coast MA UseR Group was born.         

Our R user group started as an excuse to stay in touch and build on the small community formed by the Openscapes workshop. I found a wonderful co-organizer, Amanda Meli, and together we started our monthly meetings. We invited anyone remotely interested in R to join our fledgling group and take the opportunity to “see” people again. Over the course of this last year, our local R user community in southern Massachusetts, USA, expanded to include students and professionals from across Massachusetts, the U.S. and the world with several guest speakers calling in from abroad (one of the benefits of remote meetings). 

In so many ways 2020 was a hot mess, but our monthly UseR meetings have been a welcome source of consistency, community, and learning throughout. We are continuing our remote meetings as we start year two, but with any luck (and a dash of science) I look forward to meeting some of my newfound community members in person in the coming months. But until that hope is realized I cannot thank enough my co-organizer, group members, guest speakers, and behind the scenes supporters for making this crazy year better.