Data Mining Syllabus – PyMathCamp

Demand for Data science talent is exploding. McKinsey estimates that by 2018, a 500,000 strong workforce of data scientists will be needed in US alone. The resulting talent gap must be filled by a new generation of data scientists. The term data scientist is quite ambiguous. The Center for Data Science at New York University describe data science as,

the study of the generalizable extraction of knowledge from data [using] mathematics, machine learning, artificial intelligence, statistics, databases and optimization, along with a deep understanding of the craft of problem formulation to engineer effective solutions

Data science.

Data science.

As you can see, a data scientist is a professional with a multidisciplinary profile. Optimizing the value of data is dependent on the skills of the data scientists who process the data. is offering these essentials with PyMathCamp. This course is your stepping stone to become a data scientist. Key concepts in data acquisition, preparation, exploration and visualization along with examples on how to build interactive data science solutions are presented using Ipython notebooks.
You will learn to write Python code and apply data science techniques to many field of interest, for example in finance, robotic, marketing, gaming, computer vision, speech recognition and many more. By the end of this course, you will know how to build machine learning models and derive insights from data science.

The course is organized into 11 chapters. The major components of PyMathCamp are:

1) Data management (extract, transform, load, storing, cleaning and transformation)

We begin with studying data warehousing and OLAP, data cubes technology and multidimensional databases. (Chapter 2, 3 and 4)

2) Data Mining (machine learning technology, math and statistics)

Descriptive statistics are applied for data exploration. Mining Frequent Patterns, Association and Correlations. We will also learn more on the different types of machine learning methodology through python programming. (Chapter 5)

3) Data Analysis/Prescription (classification, regression, clustering, visualization)

At this stage, we are ready to dive into data modelling with different types of machine learning methods. PyMathcamp includes many different machine learning techniques to analyse and mine data, including linear regression, logistic regression, support vector machines, ensembling and clustering among numerous others. Model construction and validation are studied. This rigorous data modelling process is further enhanced with graphical visualisation. The end result will lead to insight for intelligent decision making. (Chapter 6 and 7)

Source: Pethuru (2014)

Source: Pethuru (2014)

Encapsulating data science intelligence and investing in modelling is vital for any organization to be successful.

Hence, we will use our data mining knowledge gained from the above chapters to analyse, extract and mine different types of data for value. Or more specifically spatial and spatiotemporal data, object, multimedia, text, time series and web data. (Chapter 8, 9 and 10)

After spending a few months learning and programming with PyMathCamp, we will end the course by updating you with the latest applications and trends of data mining. (Chapter 11)

In conclusion, PyMathCamp is the perfect course for student who might not have the rigorous technical and programming background required to do data science on their own.

Credit to: Joe Choong

“Future belongs to those who figure out how to collect and use data successfully.” 

Muhammad Nurdin, CEO of IntelliJ.


PyMathCamp aims to produce modern innovator through data science & mathematics

Innovative thinking and necessary skills set are critically crucial to solve real world problems. Approaching the future, problem will be getting more complex. Malaysia is in dire need of modern innovator to develop state-of-the-art solutions to solve them. And to develop solution, with just innovative thinking is not enough.

With lack of data science and mathematics talent, Malaysia is going to have tough time to have intellectual local resources to solve local problems.

Yes, it is true that Malaysia can outsource talents to foreign expertise but it is not right to be too dependent on them all the time. Even the dependency, the supply is still insufficient. Technology transfer can be very expensive and second, foreign workers shall be taking time to adapt with local structure before developing suitable solution. The more time taken, the more money out.

Malaysia is lacking of innovators.

study data scientist Malaysia

“Malaysia may not have enough engineers, architects, and other professionals, to achieve Vision 2020 based on the low level of interest by our students in science, technology, engineering, and mathematics (STEM). If the situation goes on, Malaysia may have to depend on foreign workers to attain developed status, warn expert.” Star Sunday.

Wawasan 2020 is getting nearer yet we are still incapable to show that we can ‘supply’ the vision.

Here we are, want to provide highly-impact education which focus on data science and mathematics, to ALL Malaysian for FREE so that, whole nation can change million of lives to be better.

Introducing to you, PyMathCamp.

PyMathCamp will be an online learning platform to teach data science and mathematics that make use of programming languages such as Python, C++ or R in preparation to produce future actionable Malaysian innovator to solve problems.

The online learning platform shall help them to learn how to code and further career in science, technology, engineering and mathematics (STEM). How?

How subjects of data science and mathematics can invent innovator?

Data science and mathematics are not “subjects in the class, stay in the class”. They are basic necessities to all kind of businesses; health, agriculture, finance, social sciences, maritime sciences, planetary sciences, meteorology, geography, and many more. You name it. STEM is WIDE. 

Data science in a simple word is a study of how to gather interesting data. And the interestingness of data shall depend on the searcher or data looker. Data is one oceanic word. However he/she may want to look for a matter that he/she is desired into, he/she must learn the science of pulling it from the ocean (of data), clean it, groom it and present it informatively.

Mathematics, on the other hand, is what makes life measurable to the basic thing like genomic. Mathematics demands wisdom, judgment and maturityWe can make error to find solution, we can alter our methods or start all over. When it comes to life, reality mostly doesn’t allow us to redo anything most of the time, but when it comes to ‘measurable condition’, we are allowed to attempt to change things.

By defining their importance in state-of-the-art programming, we shall have idea how both subjects are keys to economic prosperity. Without above talents, we will have difficulties to obtain interesting parameters. To obtain, data science and mathematics must be learnt.

Modern students of PyMathCamp should expect the following:

Student shall be able to create emphatic solutions. They shall be able to build advanced innovation through data science and mathematics and deliver curing values to others.

A variety of topics such as data exploration, visualization, feature engineering, predictive analytics, predictive modeling, clustering, big data pipelines, metrics and many more should be expected.

All trainers and mentors are experts, highly trained and well-experienced Malaysians. They are specialized in data science, computer vision, big data, machine learning, artificial intelligence and etc.

Students are also expected to find own solutions by leveraging our programming community portal and discussion group (chit chat). For open source development, PyMathCamp will be integrated with Github. 

We have evidential method to improve every of users’ learning curve to the finish line.

Note that PyMathCamp will only be committed to specific fields that are data science and mathematics.

There will be no age limit.

PyMathCamp will be focusing on Python, C++ or R because it’s beginner-friendly (easy to use and understand), math supported and mother tongue of Artificial Intelligence. Truly high in-demand skills set for sure.

And it is free. Yup. No charges.

Carpe diem.

Seize the day.

We want to build smart society to build smart structures.

We want to produce intelligent society. Malaysia needs smart society to help nation grow each other better to achieve Wawasan 2020 and further ages.

Other than fulfilling job vacancy, we aim that students shall be able to invent advanced solution and create intelligent startups to solve all society’s problems. This is our deepest aim actually. We want students to be modern innovator.

In simple word, PyMathCamp is really preparing Malaysians for the amazing (automated) future.

Join PyMathCamp.

IntelliJ is a deeply value-oriented company.

We want to educate and bring Malaysian mind to advanced level, starting from small, FOR FREE, which is the essence to change Malaysia into economically, a prosperous place.

We want to produce marketable Malaysians, in this self-serving economy, with highly-impact education as the first defense.

We pray that every mission of ours enrich all lives.

“Future is belongs to those who figure out how to collect and use data successfully.” 

Muhammad Nurdin, CEO of IntelliJ.