본문 바로가기
데이터분석/Python

Build Unstack Matrix Practice

by 버섯도리 2022. 5. 9.

1000i.csv
0.02MB
movie_rating.csv
0.00MB

import numpy as np

import pandas as pd

 

import os

os.chdir("D:/11_Study/01_Python/Spyder/AI-python/lab_assignments/lab_3")

os.getcwd()

 

 

def get_rating_matrix(filename, dtype=np.float32):

    df_data = pd.read_csv(filename)

    return df_data.groupby(['source','target'])['rating'].sum().unstack().fillna(0).values.astype(dtype)

 

get_rating_matrix("movie_rating.csv")

 

 

def get_frequent_matrix(filename, dtype=np.float32):

    df_data = pd.read_csv(filename)

    df_data['rating'] = 1

    return df_data.groupby(['source','target'])['rating'].sum().unstack().fillna(0).values.astype(dtype)

 

get_frequent_matrix("1000i.csv")

 
 
 
 
 
참조 : 부스트코스(http://www.boostcourse.org) -> 머신러닝을 위한 파이썬

'데이터분석 > Python' 카테고리의 다른 글

Linear Regression - Normal Equation  (0) 2022.05.09
Data Handling Case Study  (0) 2022.05.09
NumPy Practice  (0) 2022.05.09
Data Cleansing Example  (0) 2022.05.09
MatPlotLib Example  (0) 2022.05.09