Anantha 3.0

23
Guwahati

description

The department magazine of Department of Mathematics, IIT Guwahati. Version 3.0

Transcript of Anantha 3.0

Page 1: Anantha 3.0

GuwahatiGuwahati

Page 2: Anantha 3.0

1

contentsHOD...3

President...4The Editior...5

MATRIX...6

Department Picnic...7

Batch of 2013...10

Placements...9

Page 3: Anantha 3.0

2International Statistics Year 2013

A Welcome Note...11

INSIGHT

Ripples of life...20

Credits: Cover depicts an infinite tree symbolising the concept of infinity. Pictures of Statistical year supplied by Dr. Arabin Kr. Dey. Pictures of Kurua Island by Dr. P.A.S.S. Krishna. Magazine designed by Sourav Bikash.

TECH

COLUMN

Graph Databases....15

Natural Logarithm....17

Credit Scoring...12

Gravitational Red-Shift...18

Page 4: Anantha 3.0

3message

Dr. Bhaba Kr. Sar�aHead of the Depar��entDepar��ent of MathematicsIIT Guwahati.

It gives me immense pleasure to learn that the third edition of Anatha, the magazine of MATRIX, is about to come out. The magazine has the potentialityof being a platform for intense intellectual and recreational activities of the members of MATRIX. The pages of Anantha can reflect the creativity of our students and their experiences in academic as well as in areas including training, recreational and extra-curricular activities.

I am sure the forthcoming volume will be successful in stimulating moreinterest and interactions among the members of the association with diverse ideas in mathematics and computing.

I will be very happy to see on the pages of the magazine write ups of alumni of the departmenton their experiences in the department during their study here as well as about their present experiences. I am sure such contributions would be of help to the members.Further,I wish to see the activitiesof the MATRIX with enhanced vigour and more enthusiasm in coming days. I congratulate the editorial team of Anantha for their efforts.

Page 5: Anantha 3.0

4message

Abhinav Yadav, BTech 4th Yr.President, MATRIXIIT Guwahati.

Here at MATRIX we continue our march towards professional and intellectual development, paving the road to the future. The keywords remain the same, opportunity, challenge, innovation, support and development but the efforts have increased manifold. The highly qualified and creative professors of our department with their midas touch produce winners. We are committed towards providing the students with ample opportunities to explore different ideas and fields of Mathematics and Computing. MATRIX values the fact that the college is the most important phase of a person’s life where he/she learns to work in a team, take responsibility and make critical decisions.The past year has been an eventful one. From tough courses and labs to fun filled parties, we have seen it all.

Though not very old we have been able to establish ourselves as one of the best in the world. Credit goes to the undying spirit of the students. I would like to mention the tireless work of the previous and present executive councils of MATRIX, for bringing it where it is today. Last but not the least, I would like to thank our faculty advisor Dr. N Selvaraju and the chairman Dr. B. K. Sarma for their support and guidance which has been essential in the growth of MATRIX through the years.

Page 6: Anantha 3.0

5message

Sourav Bikash, BTech 2nd Yr.Publication Scretar�, MATRIX.IIT Guwahati.

It has been a great pleasure to successfully publish this edition of Anantha.It was heartening to see the zeal and involvement of so many students. For many of us, it would be like strolling down the memory lane. Whereas for our new members, it would be a glimpse into the future. Anantha proves a serene canvas over which we wish to share our thoughts, experiences,achievements and vignettes of our lives with each other. This year we celebrate the International Year of Statistics. You can also have a glimpse at the beautiful pictures of Kurua Island, the sight for this year’s Department Picnic. This year MATRIX has been through many ups and downs. However, at the end of the day it feels great to live through them. I would like to express my gratitude to Dr. B.K.Sarma, Dr. A.K.Dey, Dr. N.Selvaraju and the MATRIX for helping me in this endeavour. Finally, I thank all my batchmates and my Department.

Page 7: Anantha 3.0

6matrix 2012-13

MATRIX is the student body of the Mathematics Department of IIT Guwahati started in 2008. Itis actively involved in promoting the career interests of the students. Over the past few years it hasorganized seminars, workshops and lectures on topics of mathematics and computing. It not only acts as an interface between the students and the teachers, but tries to increase the involvement ofstudents in the department.It consists of an executive council headed by the Head of the Department. The current council members are:President - Abhinav YadavTreasurer - Vipin SinghalTech Secretary - Chirag MaheshwariWeb Secretary - Amit MittalExternal Affairs Secretary - Manan BhutaniSocial Secretary - Devarshi WaghelaPublication Secretary - Sourav BikashApart from promoting technical and intellectual growth of its membres, it takes interests in social activities such as organizing freshers’ interaction, department picnic, placement talks, alumni interaction etc. It also acts as a forum dicussing ideas and sharing knowledge. Contribution of the faculty and the students has made MATRIX a success and hopefully will continue to do so.

Manan Bhutani Vipin Singhal Chirag Maheshwari

Page 8: Anantha 3.0

7departmental picnic

One of the prominet social initiatives of MATRIX is the Department Picnic.

This year we went to a river island - Kurua, about 40 km upstream Brahmaputra from Guwahati.

(Photo credits: Dr P.A.S.S. Krishna)

Page 9: Anantha 3.0

8

kurua

(photo credits: Dr. P.A.S.S. Krishna)

Page 10: Anantha 3.0

9placement 2012-13

The department nurtures its students to be placed in the best companies of the world. The training given the faculty is of international standard making the students fit for a career in Investment Banking Financial and Business Analysis Market Research Data Analysis Quantitative and Financial Analysis Information Technology Research and Development.This years’ placement record has been quite promising. The following are the statistics:Facebook - 1Goldman Sachs (Strategies) - 3Goldman Sachs (Technical Analyst) - 2Microsoft - 1Directi - 1Schlumberger - 1EXL Service - 1Adobe - 1Flipkart - 1Oracle Server Technologies - 1Oracle Systems - 1FICO - 2Y Media Labs - 1Capital IQ - 1iRunway - 1Virtusa - 1Ericsson - 1IBM GBS - 1 Innovaccer Management - 1CA Technologies - 1

Total on-campus placement - 24/31 (77.41%)Highest package - 110,000 USDHighest domestic package - INR 28 LPA

Median Salary - 8.2 LPAAverage Salary - 14.6 LPA

Page 11: Anantha 3.0

10the batch 2013

Bachelor of Technology 2013

Master of Science 2013

Page 12: Anantha 3.0

10

Page 13: Anantha 3.0

11 Objective of Statistics can be projected in three dimensions; making sense of the data, to forecastand to address uncertainty. A famous statement which was popularized by Mark Twain, an American author and humorist : ``There are three kinds of lies: lies, damned lies, and statistics." A statement often used to pull the leg of Statistics lovers and here is my defence : Statistics gives some decision driven by the data and decision made by any statistical algorithm always carry some probability. It never says the event will occur, rather it always try to add an extra truth of "the approximate chance" of occuring the event. Florence Nightingale, a celebrated English social reformer and statistician : "(Statistics is) the mostimportant science in the whole world, for upon it depends the practical application of every other science and of every art: the one science essential to all political and social administration, all education...." About the growing interest for Statistics, an editorial in Science by Marie Davidian and Thomas A. Louis offers the comment : ``This field will become ever more critical as academia, businesses, and governments rely increasingly on data-driven decisions, expanding the demand fo r statistics expertise." These data-driven decisions already have started, and they will undoubtedly increase. Hindrance for statistical application particularly for the researchers is the availability of irrelavantpublic data. Hal Varian, Google’s chief economist, made a nice comment regarding this issue : ``If left shoes (the data) are really cheap and you have a monopoly in right shoes (people with analytic capability toanalyze that data, i.e., statisticians), you are in a really good position." Goal of this worldwide awareness campaign are to :(1) Increase public understanding of the power and impact of statistics on all aspects of society. (2) Nurture statistics as a profession. The year 2013 is International Statistics Year. It is start to assure the rise of statisticians. Nearly 1,700 organizations in 115 countries are combining energies to promote the International Year of Statistics, a worldwide initiative that will highlight the contributions of the statistics field to finding solutions to globalchallenges. Every moment and every snapshot of the world you explore is ultimately a number sitting between zero and one. That's amazing. ``This is your year. 2013.0123, in a different order - random, maybe.This is the year of StatisticsThis is the year of the StatisticianStand up and count, stand up and be heardRaise your voiceLiveReach out and make a difference."

Dr Arabin Kr. DeyAssistant ProfessorDepartment of MathematicsIIT Guwahati

a welcome note

Page 14: Anantha 3.0

12credit scoring

A bank is a major fnancial institution in the fnancial market. Its main source of revenueis interest on loans, paid by its borrowers. However, it does not imply that the more loansit grants, the higher its profit will be. It is because a bank is subject to credit risk inlending. Credit risk is the possibility of default by borrowers, also known as default risk. Incompetitive markets, interest rates charged by different banks are very close. Therefore, inorder to survive in the financial markets, the bank must manage its credit risk well.Credit scoring is one of the effective tools to do so. It is a quantitative and objective toolto distinguish low default risk borrowers from the high risk ones.The basic idea is to developa statistical or operational research based model connecting the borrowers' characteristics,such as gender, age, income, etc, with their default risk. The resulting models are knownas scorecards or classiffiers. Making use of the credit scores, a bank can screen out bad loanapplicants, improve pricing of default risk (i.e. charging higher interest rate from riskierborrowers, and vice versa), identify important factors associated with the default risk, andmore importantly, avoid subjective judgments in the loan approval process. In short, creditscoring is beneficial to making credit-granting decisions.

Figure 1

(the scope of machine learning and statistics)

We can define a cost matrix based on credit history and other factors and calculate the threshold valueof repayment. Taking, Pr(repayment) >5/7The given cost matrix identifies the magnitude of repaid loans with refused loans, identifying the opp--ortunity cost of foregoing good loans with the amount of profit they would bring. Using the German Data-set and applying logistic regression, we have very less numbers customers that satisfy the thres-hold of probability of repayment. However the result drastically changes if only significant independentvariables are considered. We get approximately 76% success rate. This shows the importance of choosing independent variables in the data. Professionals started looking for alternative algorithms thatcan provide better accuracy regarding classification.

Page 15: Anantha 3.0

13In fact, several machine learning techniques can be compared to check the improvement. Capturingthe dependencies among the variables through proper structural learning is an important issue. Modelperformance depends largely on input variable selection. Extensive search is needed in this direction.Credit risk models are used by fnancial companies to evaluate in advance the insolvency risk caused by credits that enter into default. With this objective a mechanism is being proposed based on the key idea of writing the default probability in terms of the conditional distribution function of the time to default. The method is based on Cox's regression model.An empirical study, based on modified real data, illustrates this method. We will use the Credit Score for the German credit data that we have calculated previously as covariates in this method.

cox’s regression methodology

Classification Models merely predict the probability of default by customers; they tell us nothing aboutthe time-to-default. Obviously, that (time-to-default) is valuable information to the lender. Thus, it is the most critical limitation of Classification Models in credit risk management.In this section, a semi-parametric approach to perform the study of PD is given. Here we use Cox's proportional hazards approach to model the conditional survival function S(t|x). The key in this methodrests on the estimation of the cumulative conditional hazard function, A(j|x), using maximum likelihood.The objective is to build a conditional model for the individual PD(t|x), which is defined in terms of A(t|x). In order to describe PD_PHM, we define the following expressions relative to Cox's Regression Theory.The estimator of the conditional hazard rate function is defined as:

Under the assumptions of Proportional Hazards Model (PHM), we can define PD as,

where,

Page 16: Anantha 3.0

14the results obtained from cox’s phm:

Default rates varying with maturity of loans for 3 different forecasting periods *

By using the German credit data, we have observed the above plots.

scope and summary:

Calculation of dafault-rate is done based on semi-parametric method. Some parametric assumption for hazard function can be explored. We have assumed all borrowers are risky, or at-risk, which maynot be the case in practice. Therefore alternative statistical model can be incorporated in the above set-up.

* The variables treated here are the following: Y = maturity or loan lifetime. Here, maturitymeans time to default (T), when time is uncensored or time to withdrawal (C = 36 months),in any other case. Time was measured in months. X = scoring (credit ratio) observed foreach debtor. Here X is an univariate endogenous measure of propensity to default. Thecloser to zero the better the credit behaviour. delta = default indicator (uncensoring indicator).This variable takes value 1 if loan defaults or 0 if not.

Amanjot SinghBTech(Mathematics & Copmuting), 4th yr.

Page 17: Anantha 3.0

23

Page 18: Anantha 3.0

15

Graph databases address one of the great macroscopic business trends of today: leveraging complex and dynamic relationships to generate insight and competitive advantage. Whether we want to understand relationships between customers, elements in a telephone or datacentre network, entertainment producers and consumers, or genes and proteins, the ability to understand and analyse vast graphs of highly-connecteddata will be a key factor in the determining which companies outperform their competitors over the comingdecade. These data storage technologies are also known as NOSQL; a cheeky acronym for Not Only SQL,or more confrontationally, No to SQL.What’s remarkable about this renaissance of graph data and graph thinking is that graph theory itself is notnew. Graph theory was pioneered by Euler in the 18th century, and has been actively researched and improved since then. However, it is only in the last few years that graph theory and graph thinking have beenapplied to information management. This increased focus on graph is driven by the massive commercialsuccesses of companies such as Facebook, Google and Twitter, all of whom have centred their business models on their own proprietary graph technologies.Historically, most enterprise-level Web apps could be run atop a relational database. The NOSQL movementhas arisen in response to the needs of an emerging class of applications that must easily store and process data which is bigger in volume, changes more rapidly, and is more structurally varied that can be dealt withby traditional RDBMS deployments.

acid v/s base:

In the relational database world, we’re all familiar with ACID transactions.- Atomic: all operations in a transaction succeed or every operation is rolled back- Consistent: on transaction completion, the database is structurally sound- Isolated: transactions do not contend with one another; they are moderated and appear to run sequentially- Durable: the results of a transaction are permanent, even in case of failures

In the NOSQL world, ACID transactions have gone out of fashion. The term BASE has arisen as a popularway of describing the properties of a more optimistic storage strategy.- Basic Availability: the store appears to work most of the time- Soft-state: stores don’t have to be write-consistent- Eventual consistency: stores exhibit consistency at some later point; e.g. lazily at read time

productive analysis:One of the most appealing aspects of the graph data model is its intuitiveness. People without any backgroundin data modelling – which is to say, relational modelling – easily grasp the graph approach. The typical “whiteboard” or “back of the napkin” representation of a data problem is a graph. More often than not suchrepresentations can be directly translated into a graph data model. One of the most important contributions ofgraphs to informatics is how they make data modelling and data interrelationships available to the layperson.

graph databases

Page 19: Anantha 3.0

16the property graph model:While there are numerous kinds of graphs, most have settled on the property graph as its graph model. The main abstractions in a property graph are nodes, relationships and properties.

- Nodes are placeholders for properties.- Relationships connect nodes; they are always connected and directed. In addition every relationship has a label and can also house properties; these add semantic clarity to connections between nodes.These simple primitives are all that is needed to create sophisticated and semantically rich models.

the cypher query language:In Neo4j the language Cypher is the primary means of accessing the database. Other graph databases haveother means of querying data. A quick view of Cypher will enable us to explore more interesting graphquery scenarios.A reasonably simple query is made up of START, MATCH and RETURN clauses.- START: specifies one or more starting points in the graph- MATCH: this is the specification by example part; we “draw” the data we are interested in, using round brackets for nodes and ‘->’ or ‘<-‘ signs for relationships- RETURN: specifies which nodes, relationships and properties in the matched data should be returned to the clientHere’s an example of a Cypher query that uses these three clauses to find plays written by Shakespeare:START bard=node:authors(‘ firstname:”William” AND lastname:”Shakespeare” ’)MATCH (bard) – [:WROTE] -> (plays)RETURN playsReasons for Choosing a Graph Database:1. “Minutes to milliseconds” performance. Using index-free adjacency, a graph database turns complex joins into fast graph traversals.2. Drastically-accelerated development cycles. The graph data model reduces the impedance mismatch that has plagued software development for decades.3. Extreme business responsiveness. The schema-free nature of a graph allows it to evolve, reducing risk and time to market.

common use cases:Some of the most common graph database use cases:1. Social networks2. Recommendation engines3. Geospatial problems; like Euler solved the Seven Bridges of Konigsberg problem4. Master data management5. Network management6. Access control and authorization

Kumar HarshaBTech (Mathematics & Computing), 2nd yr.

Page 20: Anantha 3.0

17natural logarithms

In a seminar talk delivered on 29-03-2013 in Department of Mathematics, Prof. S. Pattanayakhad asked the following question:How to calculate natural logarithms without using log table and what is the idea behind it?I have tried to give an answer to this question.The definition of natural logarithms is given by:

Take, x=1+t. Then,

or,

now using the geometric expansion of 1/1+t , we get:

Using this equation, we can evaluate the aaproximate value of ln b numerically, by substituting b-1 for b. This is assuming the fact that the value of the last integral is negligible.

Rohit Madhukar PatneMSc. (Mathematics & Computing), 2nd Yr.

Page 21: Anantha 3.0

18space-time curvature distortion

The theory of general relativity predicts that gravity distorts the geometry of space-time. One of the waysto experimentally verify the presence of space-time curvature is to observe stars near the limb of the Sunduring a total solar eclipse. During such a solar eclipse, stars can be seen in the sky due to darkness thatensues. It has been verified that the known position of stars changes slightly when these stars are foundnear the limb of the Sun. Figure 1 shows the path of the photons coming from the star that is behind thesun and so should not be observable. The trajectory of the photons is bent due to the curvature of space time near the solar surface caused by the gravity of the Sun. This beam of light can be observed on the Earth during a total solar eclipse and its apparent position is slightly shifted. The measurement of such shifts confirms the presence of space time curvature by the general theory of relativity.

There exists other observational proof of space time curvature. For example, a distant quasar that is found behind another less distant galaxy is seen as four distinct images in the astronomical object called the Einstein’s cross (see figure 2). The central galaxy acts as what is called a gravitational lens by bending the light coming from the distant quasar. If for instance, both the lensing and the distant galaxy were spherical and perfectly aligned, the image of the quasar would be a circular disk seen around the central galaxy.

Figure 1

Page 22: Anantha 3.0

19

Some galaxies are lensed by other galaxiesfor which their position is slightly offset from the line of sight. Thus the curvature ofspace time caused by astronomical objectssuch as galaxies and black holes can thereforedistort astronomical observations.

Sourav BikashBTech(Mathematics & Computing), 2nd Yr.

Figure 2

References:1. An Introduction to Stellar Astrophysics, Francis LeBlanc, Wiley2. The Origin of Stars , M.D.Smith, World scientific

Page 23: Anantha 3.0

20ripples of life

Is life's journey like a placid pond which is soothing, comforting andfulfilling these days ? Small tensions, career graphs, life's ups anddowns, obsessive materialistic rat race have a rippling effect on yourminds.

Nobody knows what is in store for us in our future. Surprisingly theunknown future haunts us. But our modesty and moderate lifestyle cancertainly ease out our tensions, worries and anxieties. Enjoying simplejoys gifted to us by nature, giving back to the world more than what onehas received, socializing, sharing thoughts and time with friends,following basic courtesies and virtues certainly have a healing andtherapeutic effect on our lives.

To build an aura of positivity around us is in our hands. loving,nurturing and preserving the bounties of nature and various facets oflife, is the best way to positive living. Society is a reflection of ourethics and moral values. If we are clean and upright so shall it slowlyreflect around us. It needs perseverance, determination and a gritty willto constantly motivate oneself and take along others who touch our lives.Good virtues do not come naturally but have to be constantly practiced. Itis like scaling up a mountain step by step.

Breaking barriers of mistrust and cynicism and going back to the sereneinnocence of a child is the need of the hour. Let us listen to ourbeckoning soul to bask in the sunlight of a happy, simple, and contentedlife for a certainly "better" tomorrow.

Devarshi WaghelaBTech (Mathematics & Computing), 2nd Yr.