๋ณธ๋ฌธ ๋ฐ”๋กœ๊ฐ€๊ธฐ

๐Ÿ‘ฉ๐Ÿป‍๐Ÿ’ป DataScientist for Python/DataCamp

Introduction to statistics in python 2 - mean, median, mode

- mean ํ‰๊ท 

๊ฐ’๋“ค์„ ๋‹ค ๋”ํ•ด์„œ ์ด ๊ฐฏ์ˆ˜๋กœ ๋‚˜๋ˆˆ ๊ฐ’

- median ์ค‘์•™๊ฐ’

iloc๋กœ 83๊ฐœ์˜ ๋ฐ์ดํ„ฐ ์ค‘ ๊ฐ€์žฅ ์ค‘์•™ ๊ฐ’์ธ 41์„ ์ฐ์–ด์ฃผ๋ฉด 42๋ฒˆ์งธ ๊ฐ’์ด 10.1 ์ž„์„ ์•Œ ์ˆ˜ ์žˆ๋‹ค.

๋‹ค๋ฅธ ๋ฐฉ๋ฒ•์œผ๋กœ ์ค‘์•™๊ฐ’์„ ํ™•์ธํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” .median ์„ ์ฐ์–ด์ฃผ๋ฉด ์ค‘์•™์— ์žˆ๋Š” ๊ฐ’์„ ์ถœ๋ ฅํ•ด์ค€๋‹ค.

- mode ์ตœ๋นˆ๊ฐ’

์ตœ๋นˆ๊ฐ’์„ ํ™•์ธํ•˜๊ธฐ ์œ„ํ•ด์„œ๋Š” value_count๋ฅผ ๋‚ด๋ฆผ์ฐจ์ˆœ์œผ๋กœ ์ •๋ฆฌํ•˜์—ฌ ๊ฐ’๋“ค์˜ count๋ฅผ ํ†ตํ•ด ์ตœ๋นˆ๊ฐ’์„ ์•Œ ์ˆ˜ ์žˆ๋‹ค.

์•„๋‹ˆ๋ฉด import statistics๋ฅผ ํ•ด์ฃผ๋ฉด statistics.mode๋กœ๋„ ์ตœ๋นˆ๊ฐ’์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ๋‹ค.

 

- skew data

๋ฐ์ดํ„ฐ๊ฐ€ ์ •๊ทœ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅด๋Š” ๊ฒฝ์šฐ์—” mean ๊ฐ’๊ณผ median ๊ฐ’์ด ๋ณ„๋กœ ์ฐจ์ด๊ฐ€ ์—†์ง€๋งŒ

๋ฐ์ดํ„ฐ๊ฐ€ ์™ผ์ชฝ์ด๋‚˜ ์˜ค๋ฅธ์ชฝ์œผ๋กœ ์น˜์šฐ์ณ์ง„ ๊ฒฝ์šฐ์—๋Š” mean๊ฐ’์ด skew ๋ฐฉํ–ฅ์œผ๋กœ ๋‹น๊ฒจ์ง€๊ธฐ ๋•Œ๋ฌธ์— median ๊ฐ’๊ณผ ์ฐจ์ด๋ฅผ ๋ณด์ธ๋‹ค.

ํ‰๊ท ์€ outlier ์˜ ๊ฐ’์˜ ์˜ํ–ฅ์„ ๋งŽ์ด ๋ฐ›๊ธฐ ๋•Œ๋ฌธ์— ์ด๋ ‡๊ฒŒ ๋ณด์ด๋Š” ๊ฒฝ์šฐ์—๋Š” median ๊ฐ’์„ ์‚ฌ์šฉํ•˜๋Š” ํŽธ์ด ์ข‹๋‹ค.

- ์—ฐ์Šต

- ์Šฌ๋ผ์ด๋“œ ์ถœ์ฒ˜  Datacamp - Datascientist with python

https://campus.datacamp.com/courses/introduction-to-statistics-in-python/summary-statistics-1?ex=1