texto - pandas python tutorial español pdf

¿Cómo envío las listas como una tabla en el cuaderno Jupyter? (10)

Acabo de descubrir que tabulate tiene una opción HTML y es bastante simple de usar.
Muy similar a la respuesta de Wayne Werner:

from IPython.display import HTML, display import tabulate table = [["Sun",696000,1989100000], ["Earth",6371,5973.6], ["Moon",1737,73.5], ["Mars",3390,641.85]] display(HTML(tabulate.tabulate(table, tablefmt=''html'')))

Todavía estoy buscando algo simple de usar para crear diseños de tabla más complejos, como la sintaxis de látex y el formato para fusionar celdas y hacer sustitución de variables en un cuaderno:
Permitir referencias a variables de Python en las celdas Markdown # 2958

Sé que he visto algún ejemplo en alguna parte antes, pero por mi vida no puedo encontrarlo cuando busco en Google.

Tengo algunas filas de datos:

data = [[1,2,3], [4,5,6], [7,8,9], ]

Y quiero generar estos datos en una tabla, por ejemplo

+---+---+---+ | 1 | 2 | 3 | +---+---+---+ | 4 | 5 | 6 | +---+---+---+ | 7 | 8 | 9 | +---+---+---+

Obviamente podría usar una biblioteca como prettytable o descargar pandas o algo así, pero estoy muy desinteresado en hacerlo.

Solo quiero generar mis filas como tablas en mi celda del cuaderno Jupyter. ¿Cómo hago esto?

Finalmente volví a encontrar la documentación de jupyter / IPython que estaba buscando.

Necesitaba esto:

from IPython.display import HTML, display data = [[1,2,3], [4,5,6], [7,8,9], ] display(HTML( ''<table><tr>{}</tr></table>''.format( ''</tr><tr>''.join( ''<td>{}</td>''.format(''</td><td>''.join(str(_) for _ in row)) for row in data) ) ))

(Puede que haya comprendido ligeramente las comprensiones, pero display(HTML(''some html here'')) es lo que necesitábamos)

Hay un buen truco: envolver los datos con pandas DataFrame.

import pandas as pd data = [[1, 2], [3, 4]] pd.DataFrame(data, columns=["Foo", "Bar"])

Muestra datos como:

| Foo | Bar | 0 | 1 | 2 | 1 | 3 | 4 |

Ok, esto fue un poco más difícil de lo que pensé:

def print_matrix(list_of_list): number_width = len(str(max([max(i) for i in list_of_list]))) cols = max(map(len, list_of_list)) output = ''+''+(''-''*(number_width+2)+''+'')*cols + ''/n'' for row in list_of_list: for column in row: output += ''|'' + '' {:^{width}d} ''.format(column, width = number_width) output+=''|/n+''+(''-''*(number_width+2)+''+'')*cols + ''/n'' return output

Esto debería funcionar para un número variable de filas, columnas y número de dígitos (para números)

data = [[1,2,30], [4,23125,6], [7,8,999], ] print print_matrix(data) >>>>+-------+-------+-------+ | 1 | 2 | 30 | +-------+-------+-------+ | 4 | 23125 | 6 | +-------+-------+-------+ | 7 | 8 | 999 | +-------+-------+-------+

Podría intentar usar la siguiente función

def tableIt(data): for lin in data: print("+---"*len(lin)+"+") for inlin in lin: print("|",str(inlin),"", end="") print("|") print("+---"*len(lin)+"+") data = [[1,2,3,2,3],[1,2,3,2,3],[1,2,3,2,3],[1,2,3,2,3]] tableIt(data)

Quiero generar una tabla donde cada columna tenga el ancho más pequeño posible, donde las columnas se rellenen con espacios en blanco (pero esto se puede cambiar) y las filas se separen con nuevas líneas (pero esto se puede cambiar) y donde cada elemento se formatee con str (pero...).

def ftable(tbl, pad='' '', sep=''/n'', normalize=str): # normalize the content to the most useful data type strtbl = [[normalize(it) for it in row] for row in tbl] # next, for each column we compute the maximum width needed w = [0 for _ in tbl[0]] for row in strtbl: for ncol, it in enumerate(row): w[ncol] = max(w[ncol], len(it)) # a string is built iterating on the rows and the items of `strtbl`: # items are prepended white space to an uniform column width # formatted items are `join`ed using `pad` (by default " ") # eventually we join the rows using newlines and return return sep.join(pad.join('' ''*(wid-len(it))+it for wid, it in zip(w, row)) for row in strtbl)

La firma de la función, ftable(tbl, pad='' '', sep=''/n'', normalize=str) , con sus argumentos predeterminados, tiene por objeto proporcionar la máxima flexibilidad.

Puedes personalizar

la almohadilla de la columna ding,
el separador de fila (por ejemplo, pad=''&'', sep=''/////n'' para tener la mayor parte de una tabla LaTeX)
la función que se utilizará para normalizar la entrada a un formato de cadena común --- por defecto, para la máxima generalidad es str pero si sabe que todos sus datos son lambda item: "%.4f"%item coma flotante lambda item: "%.4f"%item podría ser una elección razonable, etc.

Pruebas superficiales:

Necesito algunos datos de prueba, posiblemente con columnas de diferente ancho para que el algoritmo sea un poco más sofisticado (pero solo un poco;)

In [1]: from random import randrange In [2]: table = [[randrange(10**randrange(10)) for i in range(5)] for j in range(3)] In [3]: table Out[3]: [[974413992, 510, 0, 3114, 1], [863242961, 0, 94924, 782, 34], [1060993, 62, 26076, 75832, 833174]] In [4]: print(ftable(table)) 974413992 510 0 3114 1 863242961 0 94924 782 34 1060993 62 26076 75832 833174 In [5]: print(ftable(table, pad=''|'')) 974413992|510| 0| 3114| 1 863242961| 0|94924| 782| 34 1060993| 62|26076|75832|833174

Recientemente usé prettytable para renderizar una buena tabla ASCII. Es similar a la salida de CLI de postgres.

import pandas as pd from prettytable import PrettyTable data = [[1,2,3],[4,5,6],[7,8,9]] df = pd.DataFrame(data, columns=[''one'', ''two'', ''three'']) def generate_ascii_table(df): x = PrettyTable() x.field_names = df.columns.tolist() for row in df.values: x.add_row(row) print(x) return x generate_ascii_table(df)

Salida:

+-----+-----+-------+ | one | two | three | +-----+-----+-------+ | 1 | 2 | 3 | | 4 | 5 | 6 | | 7 | 8 | 9 | +-----+-----+-------+

Un conjunto de funciones de uso general para representar cualquier estructura de datos de Python (dictados y listas anidadas juntas) como HTML.

from IPython.display import HTML, display def _render_list_html(l): o = [] for e in l: o.append(''<li>%s</li>'' % _render_as_html(e)) return ''<ol>%s</ol>'' % ''''.join(o) def _render_dict_html(d): o = [] for k, v in d.items(): o.append(''<tr><td>%s</td><td>%s</td></tr>'' % (str(k), _render_as_html(v))) return ''<table>%s</table>'' % ''''.join(o) def _render_as_html(e): o = [] if isinstance(e, list): o.append(_render_list_html(e)) elif isinstance(e, dict): o.append(_render_dict_html(e)) else: o.append(str(e)) return ''<html><body>%s</body></html>'' % ''''.join(o) def render_as_html(e): display(HTML(_render_as_html(e)))

Yo solia tener el mismo problema. No pude encontrar nada que me ayudara, así que terminé haciendo la clase PrintTable --code a continuación. También hay una salida. El uso es simple:

ptobj = PrintTable(yourdata, column_captions, column_widths, text_aligns) ptobj.print()

o en una línea:

PrintTable(yourdata, column_captions, column_widths, text_aligns).print()

Salida:

------------- Name ------------------- Very long name 0 Very long name 1 Very long name 2 Very long name 3 Very long name 4 Very long name 5 Very long name 6 Very long name 7 Very long name 8 Very long name 9 Very long name 10 Very long name 11 Very long name 12 Very long name 13 Very long name 14 Very long name 15 Very long name 16 Very long name 17 Very long name 18 Very long name 19 -------------------

PrintTable ------------------------------------------------------------------------------------------------ | Column 1 | Column 2 | Column 3 | Column 4 | Column 5 ------------------------------------------------------------------------------------------ | 0 | 0 | 0 | 0 | 0 | 1 | 2 | 3 | 4 | 5 | 2 | 4 | 6 | 8 | 10 | 3 | 6 | 9 | 12 | 15 | 4 | 8 | 12 | 16 | 20 | 5 | 10 | 15 | 20 | 25 | 6 | 12 | 18 | 24 | 30 | 7 | 14 | 21 | 28 | 35 | 8 | 16 | 24 | 32 | 40 | 9 | 18 | 27 | 36 | 45 | 10 | 20 | 30 | 40 | 50 | 11 | 22 | 33 | 44 | 55 | 12 | 24 | 36 | 48 | 60 | 13 | 26 | 39 | 52 | 65 | 14 | 28 | 42 | 56 | 70 | 15 | 30 | 45 | 60 | 75 | 16 | 32 | 48 | 64 | 80 | 17 | 34 | 51 | 68 | 85 | 18 | 36 | 54 | 72 | 90 | 19 | 38 | 57 | 76 | 95 ------------------------------------------------------------------------------------------ El código para la clase

# -*- coding: utf-8 -*- # Class class PrintTable: def __init__(self, values, captions, widths, aligns): if not all([len(values[0]) == len(x) for x in [captions, widths, aligns]]): raise Exception() self._tablewidth = sum(widths) + 3*(len(captions)-1) + 4 self._values = values self._captions = captions self._widths = widths self._aligns = aligns def print(self): self._printTable() def _printTable(self): formattext_head = "" formattext_cell = "" for i,v in enumerate(self._widths): formattext_head += "{" + str(i) + ":<" + str(v) + "} | " formattext_cell += "{" + str(i) + ":" + self._aligns[i] + str(v) + "} | " formattext_head = formattext_head[:-3] formattext_head = " " + formattext_head.strip() + " " formattext_cell = formattext_cell[:-3] formattext_cell = " " + formattext_cell.strip() + " " print("-"*self._tablewidth) print(formattext_head.format(*self._captions)) print("-"*self._tablewidth) for w in self._values: print(formattext_cell.format(*w)) print("-"*self._tablewidth)

Demostración

# Demonstration headername = ["Column {}".format(x) for x in range(6)] headername[0] = "Name" data = [["Very long name {}".format(x), x, x*2, x*3, x*4, x*5] for x in range(20)] PrintTable(data, / headername, / [70, 10, 10, 10, 10, 10], / ["<",">",">",">",">",">"]).print()

tabletext encaja bien

import tabletext data = [[1,2,30], [4,23125,6], [7,8,999], ] print tabletext.to_text(data)

resultado:

┌───┬───────┬─────┐ │ 1 │ 2 │ 30 │ ├───┼───────┼─────┤ │ 4 │ 23125 │ 6 │ ├───┼───────┼─────┤ │ 7 │ 8 │ 999 │ └───┴───────┴─────┘