Many hyperlinks are disabled.
Use anonymous login
to enable hyperlinks.
50 most recent check-ins
2024-11-11
| ||
22:39 | Generate comprehensive sample of concordances for each word in the presence of unparliamentary Leaf check-in: 9c6b294d87 user: sam_hames tags: refactor-features-clusters | |
2024-11-04
| ||
06:50 | Update unparliamentary analysis to incorporate a nicer format with trendlines, a fixed seed for the clustering, and a spreadsheet format for annotations check-in: c4ddaa1d77 user: sam_hames tags: refactor-features-clusters | |
2024-10-23
| ||
05:33 | Example for pulling out some hansard speeches related to unparliamentary check-in: 0c5621195a user: sam_hames tags: refactor-features-clusters | |
2024-08-16
| ||
00:57 | Remove index method for pinning clusters check-in: e13620cf23 user: sam_hames tags: refactor-features-clusters | |
2024-08-11
| ||
04:29 | Basic creation of new clusters from selection of features from a single cluster check-in: 8d37793f74 user: sam_hames tags: refactor-features-clusters | |
2024-07-31
| ||
23:43 | Further tidy up the server endpoints and display a sample of all documents if no active query check-in: 3ec0cb81fb user: sam_hames tags: refactor-features-clusters | |
01:41 | Switch to two column layout for clusters and docs check-in: 17ff562a5a user: sam_hames tags: refactor-features-clusters | |
01:40 | Bugfix: incorrect links displayed for StackExchangeCorpus check-in: 0143b2fea0 user: sam_hames tags: refactor-features-clusters | |
2024-07-29
| ||
02:03 | Tidy up padding and margins in the header check-in: 3bfd7e3a13 user: sam_hames tags: refactor-features-clusters | |
01:35 | Make top menu sticky, inline the search bar into it on the main index page check-in: 0d2c2af100 user: sam_hames tags: refactor-features-clusters | |
00:21 | Implement web interface for new cluster creation workflow and reset check-in: e120e5b9dd user: sam_hames tags: refactor-features-clusters | |
2024-07-28
| ||
23:37 | Bugfix: filter out placeholder top_k results and features with 0 intersection with query check-in: 27a3e4b5a9 user: sam_hames tags: refactor-features-clusters | |
2024-07-25
| ||
03:14 | Add methods to count the number of documents and retrieve a bitmap of all doc_ids check-in: bd508ebb4d user: sam_hames tags: refactor-features-clusters | |
2024-07-23
| ||
09:37 | Get basic server functionality working with new cluster format check-in: af9469aa64 user: sam_hames tags: refactor-features-clusters | |
07:02 | Migrate the CLI and index more fully, including migrating more of the tests from index functionality to cluster functionality. check-in: ed7b20c356 user: sam_hames tags: refactor-features-clusters | |
02:56 | Initial implementation of clusters as separate from a single clustering run, and as the bridge towards queries as the unit of analysis check-in: b072e33232 user: sam_hames tags: refactor-features-clusters | |
2024-07-16
| ||
00:56 | Rename some remaining feature_id references to feature check-in: cf8b19fe51 user: sam_hames tags: refactor-features-clusters | |
00:52 | Remove feature_ids everywhere, reference features only be fully qualified (field, value) pairs throughout check-in: 4f48c07827 user: sam_hames tags: refactor-features-clusters | |
2024-07-12
| ||
05:38 | Make sure notebook working data is properly cleaned up before running as a test Leaf check-in: 6396226312 user: sam_hames tags: trunk | |
05:35 | Refactor schema migration to better group statements into logical units check-in: ec1a75260e user: sam_hames tags: trunk | |
2024-07-09
| ||
05:17 | Make sure static files will also be included in release artifacts Leaf check-in: 179b5e1ba6 user: sam_hames tags: ui-rework | |
05:16 | Merge trunk back into this branch check-in: aebed08950 user: sam_hames tags: ui-rework | |
05:12 | Bump version in preparation for next phase of work check-in: 648a96cdba user: sam_hames tags: trunk | |
05:06 | Bugfix: make sure templates are properly included in the distributed package check-in: 4929916c99 user: sam_hames tags: trunk, release, 0.6.1 | |
04:32 | Bump version for release check-in: bdef0c24c5 user: sam_hames tags: trunk, release, 0.6.0 | |
2024-07-08
| ||
06:17 | Pull styles out to their own static css file check-in: 830a7c6e2b user: sam_hames tags: ui-rework | |
2024-07-05
| ||
04:54 | Expand comment documenting rationale for the algorithm behaviour check-in: 7688bcfad3 user: sam_hames tags: trunk | |
04:50 | Tweak algorithm settings by making moves from larger to smaller clusters easier and smaller to large clusters harder. check-in: 00c02fb129 user: sam_hames tags: trunk | |
2024-06-18
| ||
04:34 | Bugfix: missing cluster_id for navigating from cluster view back tothe overview check-in: 8e57f79a4e user: sam_hames tags: trunk | |
2024-06-07
| ||
01:26 | Make styling more specific so matching feature styling does not interfere with document styling check-in: 10574cc6d1 user: sam_hames tags: trunk | |
2024-06-04
| ||
05:55 | Bump to a dev version to test the PyPI release check-in: 7f9440602f user: sam_hames tags: trunk, 0.6.0.dev1 | |
05:54 | Only check and apply format to the hyperreal/ and tests/ folder - otherwise if you have a virtualenv in the same directory it will try to check/format all your dependencies. check-in: b4f3921692 user: sam_hames tags: trunk | |
05:41 | Update homepage in preparation for a PyPI release check-in: ac9710586a user: sam_hames tags: trunk | |
00:47 | Handle slight format change in stackexchange tags - it's now pipe delimited. This change will work on both old and new data. check-in: 5a19fb8eab user: sam_hames tags: trunk | |
2024-05-26
| ||
04:59 | Make the working directory cleaner for testing notebooks and testing cleans up the notebook to a blank state at the end check-in: 0ccc904863 user: sam_hames tags: trunk | |
04:40 | Testing environment for notebooks and the first phase of the stackexchange-corpus notebooks check-in: 60f69bc0f6 user: sam_hames tags: trunk | |
03:45 | Update readme with more information check-in: df3abb35b3 user: sam_hames tags: trunk | |
2024-05-22
| ||
02:53 | Update StackexchangeCorpus to handle the full stackoverflow dataset properly check-in: a74a9b22d5 user: sam_hames tags: trunk | |
2024-05-18
| ||
11:34 | Merge migration tidy up for next phase refactoring of schema check-in: 39b773a856 user: sam_hames tags: trunk | |
11:32 | Fix linting and docs explanation Leaf check-in: 76a73343c5 user: sam_hames tags: tidy-migrations | |
11:27 | Move to a simpler linear migration steps approach, dropping support for migrating from versions older than 0.5.0 without intermediate check-in: 214c5ed1b0 user: sam_hames tags: tidy-migrations | |
11:18 | Generate different schema versions of indexes, not corpora check-in: 8b5e35246f user: sam_hames tags: tidy-migrations | |
11:12 | Undo mistake: corpora are not versioned, the old version is the same as the new version check-in: 390bd844ed user: sam_hames tags: tidy-migrations | |
11:02 | Fix minor linting issues check-in: a518f87665 user: sam_hames tags: tidy-migrations | |
11:01 | Prepare a plaintext corpus with schema version 10 and the old schema, so I can test tidying up migrations check-in: 4c1e8e9066 user: sam_hames tags: tidy-migrations | |
2024-05-14
| ||
23:19 | Move py7zr import to the only function where it's needed check-in: 96a19097b3 user: sam_hames tags: trunk | |
12:38 | Disable new pylint error that can't occur in that location check-in: ae5422be42 user: sam_hames tags: trunk | |
12:37 | Make py7zr an optional dependency for stackexchange only check-in: 4b94cad59f user: sam_hames tags: trunk | |
12:21 | Update the stackexchange corpus to work directly with the 7z archives from https://archive.org/download/stackexchange check-in: b5f18fb260 user: sam_hames tags: trunk | |
10:18 | Bugfix: incorrect doc fields for StackexchangeCorpus check-in: b8685bfee4 user: sam_hames tags: trunk | |