Sunday, May 22, 2016

Introduction to R (2/10) - Programming Fundamentals

In the first article of the series, we've introduced R and explained why R is one the most popular language when it comes to explore and manipulate data, whether it is just to visualize them, build more advanced analytics or use machine learning algorithms. We've started to dig into the many language strengths, including the library of packages and functions, the ability to connect to many data sources, tools for statistics and computations. Obviously, R is also successful because it is free software and because it does not require advanced programming skill. You should now have been able to install R and RStudio and run a few commands.

Yes, it is true: we've said that R does not require advanced programming skills! Although that statement is true, there are some basic programming elements and methods that need to be learned by any R user. This article introduces some fundamentals of R Programming.

Saturday, May 21, 2016

Introduction to R (1/10) - An Efficient Language for Data Processing

R is a programming language built for statistical and graphics purposes. For reasons that we are going to develop in this article and the ones coming after, R has grown in popularity to become one of the highest ranked widely used software among data miners and statisticians.

The fact that R is free to use, the wealth of the R community as well as the incredible number of packages that have been developed over the years make it an essential component to anyone involved in data science, data mining or machine learning. Many data management engines, like Spark, SQL Server or Oracle now provide deep R integration so that as much as SQL, it is important learn and know a little about how to use R today, if not embracing it and figuring out what makes R most reliable options. We will just do that and we will start by presenting RStudio which is the developing environment provided with R.

Sunday, January 10, 2016

Oracle database 12c with Oracle Linux 7.x and LXC

Linux Containers (LXC) are now certified to run Oracle Database 12c, including Oracle RAC and ASM. You can check out for yourself in the Virtualization Support Matrix as well as in the Database 12.1 Release Notes. If you want to give it quick try and figure out what could be done, nothing is very complicated. Below is a quick introduction. It demonstrates how to use Oracle database 12.1 with Oracle Linux 7.x and LXC container.

Sunday, May 31, 2015

Building an Oracle Linux Amazon Web Services AMI with Packer

No need to explain why Amazon Web Services is a nice platform to run Oracle. The largest IaaS Cloud Provider is one of the few Oracle Authorized Cloud Environments. You can Bring Your Own "Oracle" Licenses (BYOL) on AWS EC2 or RDS. You can pay for what you've provisioned because it is considered as a hard partitioned Platform.

Obviously, there are some drawbacks including the one pointed by Marc Fielding on Pythian's Blog or the fact you cannot run RAC. But it is, for sure, a nice playground to run Oracle technologies, your applications on Oracle or any Oracle applications including JD Edwards, Peoplesoft, Oracle E-Business Suite and many more... In addition, Amazon RDS for Oracle offers an attractive pay-per-use and "per-hour" model for Oracle Database SE1.

On the other hand, Oracle Linux is probably the best operating system to run Oracle, on AWS EC2 like on any x86_64 Platforms: it requires one-only rpm to be ready for most Oracle software; it is free and you don't need to support all your instances the same way as described in the Service Levels section of "Oracle Linux and Oracle VM Support Policies". Last and not least, it is 100% compatible with Redhat Enterprise Linux and you can even get AMIs from Oracle on the AWS Marketplace.

This blog presents and easy and fast way to create an Oracle Linux AMI from the Open Virtual Format (OVF) with AWS VM Import/Export... It relies on Virtualbox, and Kickstart.

Monday, May 25, 2015

Customizing Redhat/Oracle Linux 6 ISOs

Performing a complete Redhat/Oracle Linux installation when booting from an ISO is very neat. This is an easy and consistent method to quickly start virtual machines with Packer for VirtualBox, VMWare or Parallels.

You can find a complete example of how to do it with Redhat/Oracle Linux 7 in "Customizing Redhat/Oracle Linux CDROMs, ISOs and USBs". However, I had to use this method with release 6 of Oracle Linux this week and, as you can expect, some of the parameters have changed. That's why I've documented how to do it also with Release 6. It should not take you more than 15 minutes to customize your own ISO file and 5 minutes to perform a new installation...

Tuesday, September 16, 2014

Puppet on Oracle Linux 7: Master and Agent Installation

Automation is changing the server management game. Not only, you can now deal with hundreds of servers from a single access point but you can also evaluate impacts, schedule changes and make sure servers stay aligned over time. This can drastically increase system and application service levels by reducing the way they differ one from the other.

When it comes to managing Oracle workloads, Puppet is one of the few automation frameworks of choice. It is easy to use, widely adopted and you can find predefined modules for Oracle on Puppet Forge. It addresses most of the challenges you will face with (1) templates and golden images that are difficult to evolve once deployed ; (2) all-or-nothing software packagers like yum that don't allow fine grained customization required by real applications and (3) custom scripts that are difficult to managed in non-standardized environments.

This article outlines the steps required to install a Puppet Master Server as well as a Puppet Agent on Oracle Linux 7.

Monday, August 11, 2014

How to Install Oracle Linux 7 with Kickstart, PXE and Dnsmasq

Assuming you've synchronized an Oracle Linux 7 repository and you publish it from an HTTP server, you are just missing dnsmasq and a few files to perform an installation of Oracle Linux 7 with Kickstart and PXE. It will not take you more than 15 minutes to configure the whole thing...

dnsmasq is the perfect tool for that. It includes a DHCP server, a basic TFTP server and it is very simple to configure.

This article shows an example of configuration. It explains the few files that are required and come from the distribution. You should be able to add or modify the other files manually to get the work done...