معرفی شرکت ها

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

تبلیغات ما

مشتریان به طور فزاینده ای آنلاین هستند. تبلیغات می تواند به آنها کمک کند تا کسب و کار شما را پیدا کنند.

مشاهده بیشتر

توضیحات

save and merge browser history and metadata from different browsers

ویژگی	مقدار
سیستم عامل	-
نام فایل	browserexport-0.2.9
نام	browserexport
نسخه کتابخانه	0.2.9
نگهدارنده	[]
ایمیل نگهدارنده	[]
نویسنده	Sean Breckenridge
ایمیل نویسنده	seanbrecke@gmail.com
آدرس صفحه اصلی	https://github.com/seanbreckenridge/browserexport
آدرس اینترنتی	https://pypi.org/project/browserexport/
مجوز	MIT

# browserexport [![PyPi version](https://img.shields.io/pypi/v/browserexport.svg)](https://pypi.python.org/pypi/browserexport) [![Python 3.7|3.8|3.9](https://img.shields.io/pypi/pyversions/browserexport.svg)](https://pypi.python.org/pypi/browserexport) [![PRs Welcome](https://img.shields.io/badge/PRs-welcome-brightgreen.svg?style=flat-square)](http://makeapullrequest.com) This: - locates and backs up browser history by copying the underlying database files to some directory you specify - can identify and parse the resulting database files into some common schema: ``` Visit: url: the url dt: datetime (when you went to this page) metadata: title: the <title> for this page description: the <meta description> tag from this page preview_image: 'main image' for this page, often opengraph/favicon duration: how long you were on this page ``` `metadata` is dependent on the data available in the browser (e.g. firefox has preview images, chrome has duration, but not vice versa) This currently supports: - [Firefox](https://www.mozilla.org/en-US/firefox/new/) - [Waterfox](https://www.waterfox.net/) - Firefox Android (pre-2020 schema and current [Fenix](https://github.com/mozilla-mobile/fenix)) - [Chrome](https://www.google.com/chrome/) - [Chromium](https://www.chromium.org/chromium-projects/) - [Brave](https://brave.com/) - [Vivaldi](https://vivaldi.com/) - [Arc](https://arc.net/) - [Safari](https://www.apple.com/safari/) - [Palemoon](https://www.palemoon.org/) This doesn't aim to offer a way to 'restore' your history (see [#16](https://github.com/seanbreckenridge/browserexport/issues/16) for discussion) This can probably extract visits from other Firefox/Chromium-based browsers, but it doesn't know how to locate them to `save` them ## Install `python3 -m pip install --user browserexport` Requires `python3.7+` ## Usage ### `save` ``` Usage: browserexport save [OPTIONS] Backs up a current browser database file Options: -b, --browser [chrome|firefox|safari|brave|waterfox|chromium|vivaldi|palemoon|arc] Browser name to backup history for --form-history [firefox] Browser name to backup form (input field) history for --pattern TEXT Pattern for the resulting timestamped filename, should include an str.format replacement placeholder -p, --profile TEXT Use to pick the correct profile to back up. If unspecified, will assume a single profile [default: *] --path FILE Specify a direct path to a database to back up -t, --to DIRECTORY Directory to store backup to [required] --help Show this message and exit. ``` Must specify one of `--browser`, `--form-history` or `--path` Since browsers typically remove old history over time, I'd recommend backing up your history periodically, like: ```shell $ browserexport save -b firefox --to ~/data/browser_history $ browserexport save -b chrome --to ~/data/browser_history $ browserexport save -b safari --to ~/data/browser_history ``` That copies the sqlite databases which contains your history `--to` some backup directory. If a browser you want to backup is Firefox/Chrome-like (so this would be able to parse it), but this doesn't support locating it yet, you can directly back it up with the `--path` flag: ```shell $ browserexport save --path ~/.somebrowser/profile/places.sqlite \ --to ~/data/browser_history ``` The `--pattern` argument can be used to change the resulting filename for the browser, e.g. `--pattern 'places-{}.sqlite'` or `--pattern "$(uname)-{}.sqlite"`. The `{}` is replaced by the browser name. Feel free to create an issue/contribute a [browser](./browserexport/browsers/) file to locate the browser if this doesn't support some browser you use. Can pass the `--debug` flag to show [`sqlite_backup`](https://github.com/seanbreckenridge/sqlite_backup) logs ``` $ browserexport --debug save -b firefox --to . [D 220202 10:10:22 common:87] Glob /home/sean/.mozilla/firefox with */places.sqlite (non recursive) matched [PosixPath('/home/sean/.mozilla/firefox/ew9cqpqe.dev-edition-default/places.sqlite')] [I 220202 10:10:22 save:18] backing up /home/sean/.mozilla/firefox/ew9cqpqe.dev-edition-default/places.sqlite to /home/sean/Repos/browserexport/firefox-20220202181022.sqlite [D 220202 10:10:22 core:110] Source database files: '['/tmp/tmpcn6gpj1v/places.sqlite', '/tmp/tmpcn6gpj1v/places.sqlite-wal']' [D 220202 10:10:22 core:111] Temporary Destination database files: '['/tmp/tmpcn6gpj1v/places.sqlite', '/tmp/tmpcn6gpj1v/places.sqlite-wal']' [D 220202 10:10:22 core:64] Copied from '/home/sean/.mozilla/firefox/ew9cqpqe.dev-edition-default/places.sqlite' to '/tmp/tmpcn6gpj1v/places.sqlite' successfully; copied without file changing: True [D 220202 10:10:22 core:64] Copied from '/home/sean/.mozilla/firefox/ew9cqpqe.dev-edition-default/places.sqlite-wal' to '/tmp/tmpcn6gpj1v/places.sqlite-wal' successfully; copied without file changing: True [D 220202 10:10:22 core:230] Running backup, from '/tmp/tmpcn6gpj1v/places.sqlite' to '/home/sean/Repos/browserexport/firefox-20220202181022.sqlite' [D 220202 10:10:22 save:14] Copied 1840 of 1840 database pages... [D 220202 10:10:22 core:246] Executing 'wal_checkpoint(TRUNCATE)' on destination '/home/sean/Repos/browserexport/firefox-20220202181022.sqlite' ``` For Firefox Android, backing up the database from [Fenix](https://github.com/mozilla-mobile/fenix/) (at `data/data/org.mozilla.fenix/files/places.sqlite`) requires a rooted Android phone. ### `inspect`/`merge` ``` Usage: browserexport inspect [OPTIONS] SQLITE_DB Extracts visits from a single sqlite database Provide a history database as the first argument Drops you into a REPL to access the data Options: -s, --stream Stream JSON objects instead of printing a JSON list -j, --json Print result to STDOUT as JSON --help Show this message and exit. ``` ``` Usage: browserexport merge [OPTIONS] SQLITE_DB... Extracts visits from multiple sqlite databases Provide multiple sqlite databases as positional arguments, e.g.: browserexport merge ~/data/firefox/*.sqlite Drops you into a REPL to access the data Options: -s, --stream Stream JSON objects instead of printing a JSON list -j, --json Print result to STDOUT as JSON --help Show this message and exit. ``` Logs are hidden by default. To show the debug logs set `export BROWSEREXPORT_LOGS=10` (uses [logging levels](https://docs.python.org/3/library/logging.html#logging-levels)) or pass the `--debug` flag. As an example: ```bash browserexport --debug merge ~/data/firefox/* ~/data/chrome/* [D 210417 21:12:18 merge:38] merging information from 24 sources... [D 210417 21:12:18 parse:19] Reading visits from /home/sean/data/firefox/places-20200828223058.sqlite... [D 210417 21:12:18 common:40] Chrome: Running detector query 'SELECT * FROM keyword_search_terms' [D 210417 21:12:18 common:40] Firefox: Running detector query 'SELECT * FROM moz_meta' [D 210417 21:12:18 parse:22] Detected as Firefox [D 210417 21:12:19 parse:19] Reading visits from /home/sean/data/firefox/places-20201010031025.sqlite... [D 210417 21:12:19 common:40] Chrome: Running detector query 'SELECT * FROM keyword_search_terms' .... [D 210417 21:12:48 common:40] Firefox: Running detector query 'SELECT * FROM moz_meta' [D 210417 21:12:48 common:40] Safari: Running detector query 'SELECT * FROM history_tombstones' [D 210417 21:12:48 parse:22] Detected as Safari [D 210417 21:12:48 merge:51] Summary: removed 3001879 duplicates... [D 210417 21:12:48 merge:52] Summary: returning 334490 visit entries... Use vis to interact with the data [1] ... ``` To dump all that info to JSON: ``` browserexport merge --json ~/data/browser_history/*.sqlite > ./history.json du -h history.json 67M history.json ``` Or, to create a quick searchable interface, using [`jq`](https://github.com/stedolan/jq) and [`fzf`](https://github.com/junegunn/fzf): `browserexport merge -j --stream ~/data/browsing/*.sqlite | jq '"\(.url)|\(.metadata.description)"' | awk '!seen[$0]++' | fzf` Merged files like `history.json` above can also be used as inputs files themselves, this reads those by mapping the JSON onto the `Visit` schema directly. If you don't care about keeping the raw databases for any other auxillary info like form or bookmark data and just want the url, visit date and metadata, you could use `merge` to periodically merge the bulky `.sqlite` files into a JSON dump: ```bash cd ~/data/browsing # backup databases rsync -Pavh ~/data/browsing ~/.cache/browsing # merge all sqlite databases into a single json file browserexport --debug merge --json * > '/tmp/browsing.json' # remove sqlite databases rm *.sqlite *.db # move merged data to database directory mv /tmp/browsing.json ~/data/browsing # test reading the merged data browserexport merge ~/data/browsing/* ``` I do this every couple months with a script [here](https://github.com/seanbreckenridge/bleanser/blob/master/bin/merge-browser-history), and then sync my old databases to a harddrive for more long-term storage ## HPI If you want to cache the merged results, this has a [module in HPI](https://github.com/karlicoss/HPI) which handles locating/caching and querying the results. See [setup](https://github.com/karlicoss/HPI/blob/master/doc/SETUP.org#install-main-hpi-package) and [module setup](https://github.com/karlicoss/HPI/blob/master/doc/MODULES.org#mybrowser). That uses [cachew](https://github.com/karlicoss/cachew) to automatically cache the merged results, recomputing whenever you backup new databases As a few examples: ``` $ hpi doctor -S my.browser.all ✅ OK : my.browser.all ✅ - stats: {'history': {'count': 721951, 'last': datetime.datetime(2021, 4, 19, 2, 26, 8, 29825, tzinfo=datetime.timezone.utc)}} ``` ``` # supports arbitrary queries, e.g. how many visits did I have in January 2020? $ hpi query my.browser.all --order-type datetime --after '2022-01-01 00:00:00' --before '2022-01-31 23:59:59' | jq length 50432 # how many github URLs in the past month $ hpi query my.browser.all --recent 4w -s | jq .url | grep 'github.com' -c 16357 ``` ## Library Usage To save databases: ```python from browserexport.save import backup_history backup_history("firefox", "~/data/backups") ``` To merge/read visits from databases: ```python from browserexport.merge import read_and_merge read_and_merge(["/path/to/database", "/path/to/second/database", "..."]) ``` If this doesn't support a browser and you wish to quickly extend without maintaining a fork (or contributing back to this repo), you can pass a `Browser` implementation (see [browsers/all.py](./browserexport/browsers/all.py) and [browsers/common.py](./browserexport/browsers/common.py) for more info) to `browserexport.parse.read_visits` or programatically override/add your own browsers as part of the `browserexport.browsers` namespace package. You can also use [`sqlite_backup`](https://github.com/seanbreckenridge/sqlite_backup) to copy your current browser history into a sqlite connection in memory, without ever writing to disk: ```python from browserexport.browsers.all import Firefox from browserexport.parse import read_visits from sqlite_backup import sqlite_backup db_in_memory = sqlite_backup(Firefox.locate_database()) visits = list(read_visits(db_in_memory)) # to merge those with other saved files from browserexport.merge import merge_visits, read_and_merge merged = list(merge_visits([ visits, read_and_merge(["/path/to/another/database.sqlite", "..."]), ])) ``` #### Comparisons with Promnesia A lot of the initial queries/ideas here were taken from [promnesia](https://github.com/karlicoss/promnesia) and the [`browser_history.py`](https://github.com/karlicoss/promnesia/blob/0e1e9a1ccd1f07b2a64336c18c7f41ca24fcbcd4/scripts/browser_history.py) script, but creating a package here allows its to be more extendible, e.g. allowing you to override/locate additional databases. TLDR: promnesia lets you explore your browsing history in context: where you encountered it, in chat, on Twitter, on Reddit, or just in one of the text files on your computer. This is unlike most modern browsers, where you can only see when you visited the link. Since [promnesia #375](https://github.com/karlicoss/promnesia/pull/375), `browserexport` is used in [promnesia](https://github.com/karlicoss/promnesia) in the `browser.py` file (to read any of the supported databases here from disk), see [setup](https://github.com/karlicoss/promnesia#setup) and [the browser source quickstart](https://github.com/karlicoss/promnesia/blob/master/doc/SOURCES.org#browser) in the instructions for more Eventually this project may be used in promnesia to replace the `browser.py` file ## Contributing Clone the repository and [optionally] create a [virtual environment](https://docs.python.org/3/library/venv.html) to do your work in. ```bash git clone https://github.com/seanbreckenridge/browserexport cd ./browserexport # create a virtual environment to prevent possible package dependency conflicts python -m venv .venv source .venv/bin/activate ``` ### Development/Testing To install, run: ```bash python3 -m pip install '.[testing]' ``` If running in a virtual environment, `pip` will automatically install dependencies into your virtual environment. If running `browserexport` happens to use the globally installed `browserexport` instead, you can use `python3 -m browserexport` to ensure its using the version in your virtual environment. After making changes to the code, reinstall by running `pip install .`, and then test with `browserexport` or `python3 -m browserexport` ### Testing While developing, you can run tests with: ```bash pytest flake8 ./browserexport mypy ./browserexport ```

نیازمندی

مقدار	نام
>=8.0	click
-	logzero
-	IPython
>=0.1.6	sqlite-backup
-	pytest
-	mypy
-	flake8

زبان مورد نیاز

مقدار	نام
>=3.7	Python

نحوه نصب

نصب پکیج whl browserexport-0.2.9:

pip install browserexport-0.2.9.whl

نصب پکیج tar.gz browserexport-0.2.9:

pip install browserexport-0.2.9.tar.gz