Awesome Open Source
Awesome Open Source

VisiData v0.41

A curses interface for exploring and arranging tabular data

Usable via any remote shell which has Python3 installed.

VisiData silent demo

Features

  • browse first rows of huge csv/tsv/xlsx files immediately
  • F1 for command help sheet
  • open .csv, .tsv, .json, .hdf5, .xlsx
  • Ctrl-Save .csv, .tsv
  • hjkl cursor movement, top/middle/bottom scroll to position screen cursor
  • [/] sort asc/desc by one column
  • edit cell contents
  • search/select/unselect by regex in column
  • Frequency table for current column with histogram
  • inner/outer/full/diff joins on any number of sheets, matching designated key columns
  • add new column by Python expression
  • Ctrl-O to eval an expression and browse the result as a python object
  • watch long-running sheets load asynchronously
  • : split column (on any sheet)
  • + join selected columns on Columns sheet

Metasheets

  • Sheets metasheet to manage/navigate multiple sheets
  • Columns metasheet
  • Options sheet to change the style or behavior
  • ^Error metasheet
  • ^Threads metasheet

Columns

On the Columns sheet, these commands apply to rows (the columns of the source sheet), instead of the columns on the Columns sheet

  • - hides column (sets width to 0)
  • _ maximizes column width to fit longest value
  • ! marks column as a key column (pins to the left and matches on sheet joins)

Column typing

  • columns start out untyped (unless the source data is typed)
    • # sets column type to int
    • $ sets column type to str
    • % sets column type to float
    • @ sets column type to date
    • ~ autodetects column type
  • all values are stored in their original format, and only converted on demand and as needed.
  • values that can't be properly converted are flagged with ~ on the display
  • for commands like sort which require a correctly typed value, the default (0) value for that type is used
  • cell edits are rejected if they don't convert to the column type

Installation

    $ pip3 install visidata

Dependencies

  • Python 3.3
  • openpyxl (if opening .xlsx files)
  • h5py and numpy (if opening .hdf5 files)
  • google-api-python-client (if opening Google Sheets; must also set up OAuth credentials
  • dateutil.parser (if converting string column to datetime)

Remember to install the Python3 versions of these modules with e.g. pip3

Usage

    $ vd [-r/--readonly] [<input> ...]

Inputs may be paths or URLs. If no inputs are given, starts exploring the current directory. Unknown filetypes are by default viewed with a text browser.

Commands

Definitions of terms used in the help and documentation:

  • 'go': move cursor

  • 'move': change layout of visible data

  • 'show': put on status line

  • 'scroll': change set of visible rows

  • 'push': move a sheet to the top of the sheets list (thus making it immediately visible)

  • 'open': create a new sheet from a file or url

  • 'load': reload an existing sheet from in-memory contents

  • 'jump': change to existing sheet

  • 'drop': drop top (current) sheet

  • 'this': current [row/column/cell] ('current' is also used)

  • 'abort': exit program immediately

F1 opens the Help Sheet, which shows the available commands along with a brief description. This sheet can be searched, sorted, and filtered just like any other sheet.

Here are slightly better descriptions of some non-obvious commands:

  • the "global prefix": always applies to the next command only, but could mean "apply to all columns" (as with the regex search commands) or "apply to selected rows" (as with delete) or "apply to all sheets" (as with q). The global_action column on the Help Sheet shows the specific way the global prefix changes each command.

  • = "add column expression" takes a Python expression as input and appends a new column, which evaluates the expression over the row.

  • Ctrl-Save sheet: the output type is determined by the file extension (currently .tsv and .csv)

  • R sets the source type of the current sheet. The current sheet remains until a reload (Ctrl-R).

  • When sheets are joined, the rows are matched by the display values in the key columns. Different numbers of key columns cannot match (no partial keys and rollup yet). The join types are:

    • &: Join all selected sheets, keeping only rows which match keys on all sheets (inner join)
    • +: Join all selected sheets, keeping all rows from first sheet (outer join, with the first selected sheet being the "left")
    • *: Join all selected sheets, keeping all rows from all sheets (full join)
    • ~: Join all selected sheets, keeping only rows NOT in all sheets (diff join)
  • Edits made to a joined sheet are by design automatically reflected back to the source sheets.

Credits/Contributions

VisiData was created by Saul Pwanson <[email protected]>.

VisiData is currently under active development (as of January 2017).

VisiData needs lots of usage and testing to help it become useful and dependable. If you are actively using VisiData, please let me know! Maybe there is an easy way to improve the tool for both of us.

Also please create a GitHub issue if anything doesn't appear to be working right. If you get an unexpected error (on the status line), please include the full stack trace that you get with ^E.

Please contact me at the email address above if you would like to contribute in some other way.

Inspirations and Related Work

License

VisiData is licensed under GPLv3.

Related Awesome Lists
Top Programming Languages
Top Projects

Get A Weekly Email With Trending Projects For These Topics
No Spam. Unsubscribe easily at any time.
Python (795,011
Cli (131,628
Json (39,362
Types (30,113
Csv (15,104
Sqlite (9,762
Pandas (6,532
Set (5,960
Spreadsheet (4,971
Open Data (2,448
Eda (1,484
Tui (1,136
Tsv (941
Devops Tools (629
Tabular Data (468
Hdf5 (191
Data Journalism (110
Reconciliation (36