데이터분석/Python

Build Unstack Matrix Practice

버섯도리 2022. 5. 9. 11:24

1000i.csv
0.02MB
movie_rating.csv
0.00MB

import numpy as np

import pandas as pd

 

import os

os.chdir("D:/11_Study/01_Python/Spyder/AI-python/lab_assignments/lab_3")

os.getcwd()

 

 

def get_rating_matrix(filename, dtype=np.float32):

    df_data = pd.read_csv(filename)

    return df_data.groupby(['source','target'])['rating'].sum().unstack().fillna(0).values.astype(dtype)

 

get_rating_matrix("movie_rating.csv")

 

 

def get_frequent_matrix(filename, dtype=np.float32):

    df_data = pd.read_csv(filename)

    df_data['rating'] = 1

    return df_data.groupby(['source','target'])['rating'].sum().unstack().fillna(0).values.astype(dtype)

 

get_frequent_matrix("1000i.csv")

 
 
 
 
 
참조 : 부스트코스(http://www.boostcourse.org) -> 머신러닝을 위한 파이썬