100% de satisfacción garantizada Inmediatamente disponible después del pago Tanto en línea como en PDF No estas atado a nada
logo-home
Statistics and Data Analysis (in Collab) final exam notes 10,49 €   Añadir al carrito

Notas de lectura

Statistics and Data Analysis (in Collab) final exam notes

 42 vistas  0 veces vendidas

Statistics uses mathematical tools to organize and summarize data obtained from the real world and draw conclusions from a correct interpretation of these data. In the business world, statistics can help assess the attractiveness of a business opportunity, increase customer satisfaction, choose bet...

[Mostrar más]

Vista previa 3 fuera de 28  páginas

  • 23 de febrero de 2023
  • 28
  • 2020/2021
  • Notas de lectura
  • Giovanna lamastra pacheco
  • Todas las clases
  • sampling
  • types of variables
Todos documentos para esta materia (1)
avatar-seller
martaescrivderomancebrin
APUNTES STATS FINAL

1. Types of Variables – Python




2. Python Libraries


The most important libraries we will use during this course are:

• numpy (np): for high-level mathematical functions/numerical analysis
• scipy.stats (ss): for probability distributions
• pandas (pd): for data structuring and manipulation
• matplotlib.pyplot (plt): for plots

3. Pandas

Some useful functions to always remember:

• describe(): summary statistics for each column of the dataset
• head(): print the first 5 rows of the data set
• tail(): print the last 5 rows of the dataset
• dtypes: type of variable in each column
• shape: number of rows and columns

, 4. Manipulation with Pandas

Filter rows (slicing), example: assume we want to select only students with age
between 30 and 33 included.




Create a new column, example: assume we want to add a new column, recording the
gender of students. Here you can see the gender of the 3 students: male, female, male




We want to assign a letter grade to students, with the following rule:

• gpa >= 8 letter grade A

• 6 <= gpa<8 letter grade B

• gpa < 6 letter C

, Sort columns, example: assume we want to sort the data frame with respect to the value
of a given column. We want to get the data frame sorted with respect to the gpa, in
decreasing order:




5. Graphs

We can describe categorical variables using frequency distribution tables and graphs
such as bar charts, pie charts and histograms.



FREQUENCY DISTRIBUTION TABLE

A frequency distribution is a table used to organize data. The left column (called
classes) includes all possible responses to a variable being studied. The right column is
a list of the frequencies (number of observations of each class);

A cumulative frequency distribution contains the total number of observations whose
values are less than the upper limit for each class. It is used to determine the number of
observations that lie above (or below) a particular value;

A relative/percentage frequency distribution is obtained by dividing each frequency
by the total number of observations (n). It can be expressed as a percentage;

A relative/percentage cumulative frequency distribution is the quotient between the
cumulative frequency of a particular value and the total number of observations (n). It
can be expressed as a percentage.

Los beneficios de comprar resúmenes en Stuvia estan en línea:

Garantiza la calidad de los comentarios

Garantiza la calidad de los comentarios

Compradores de Stuvia evaluaron más de 700.000 resúmenes. Así estas seguro que compras los mejores documentos!

Compra fácil y rápido

Compra fácil y rápido

Puedes pagar rápidamente y en una vez con iDeal, tarjeta de crédito o con tu crédito de Stuvia. Sin tener que hacerte miembro.

Enfócate en lo más importante

Enfócate en lo más importante

Tus compañeros escriben los resúmenes. Por eso tienes la seguridad que tienes un resumen actual y confiable. Así llegas a la conclusión rapidamente!

Preguntas frecuentes

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

100% de satisfacción garantizada: ¿Cómo funciona?

Nuestra garantía de satisfacción le asegura que siempre encontrará un documento de estudio a tu medida. Tu rellenas un formulario y nuestro equipo de atención al cliente se encarga del resto.

Who am I buying this summary from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller martaescrivderomancebrin. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy this summary for 10,49 €. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

45,681 summaries were sold in the last 30 days

Founded in 2010, the go-to place to buy summaries for 14 years now

Empieza a vender
10,49 €
  • (0)
  Añadir