Many hyperlinks are disabled.
Use anonymous login
to enable hyperlinks.
50 most recent check-ins
|
2026-02-04
| ||
| 01:38 | Add test file for corpus interface! Leaf check-in: 8dc78cfb78 user: sam_hames tags: trunk | |
|
2026-01-21
| ||
| 23:56 | Back to next development release check-in: 2d39cbe18a user: sam_hames tags: trunk | |
| 23:48 | Bump version for a bugfix binder related release check-in: 94c4994342 user: sam_hames tags: trunk, release, 0.7.1 | |
| 23:41 | Move the binder specific configuration to the handier binder/ config folder check-in: 8b258407ef user: sam_hames tags: trunk | |
| 05:33 | Update indexing and display logic for conversations notebooks check-in: 16c3cacb2b user: sam_hames tags: trunk | |
| 05:32 | Update dependencies to the correct versions for the jupyter notebooks check-in: 7b9a5ecf79 user: sam_hames tags: trunk | |
| 05:32 | Update defaults for HTML document rendering - don't include the doc_key, as it will only be useful in some cases - better to leave this to customisation on the corpus object check-in: acde84ff2d user: sam_hames tags: trunk | |
| 05:31 | Bugfix: not using the correct logic to mark fields as range encoded based on indexed parameters check-in: 292c251100 user: sam_hames tags: trunk | |
| 03:24 | Cleanup and start on the next version check-in: 72078f640b user: sam_hames tags: trunk | |
| 03:11 | New release at last incorporating the substantial reworking of the interface and workflow check-in: aee8e0b75b user: sam_hames tags: trunk, release, 0.7.0 | |
| 02:55 | Draft notebook for conversation aware indexing of transcripts in word docs check-in: 67bd67da0f user: sam_hames tags: trunk | |
| 02:54 | Initial experimental setup for binderhub environment check-in: 562b229735 user: sam_hames tags: trunk | |
| 02:53 | Bugfix - note generating URLs correctly for main nav in jupyter notebook context check-in: 714befd1c6 user: sam_hames tags: trunk | |
| 01:44 | Tweak docstring for refine_clustering check-in: e9a53d065e user: sam_hames tags: trunk | |
|
2026-01-20
| ||
| 05:55 | Merge extensive refactoring of the data model and other changes so we're back on trunk based development check-in: 5ddcd4c673 user: sam_hames tags: trunk | |
| 04:42 | Add missing file :/ Closed-Leaf check-in: c9f2d5fb0e user: sam_hames tags: refactor-all | |
| 04:41 | Add additional information around construction of indexable docs to more finely encode what is and isn't range encoded check-in: 0984c7ea5a user: sam_hames tags: refactor-all | |
| 01:21 | Construct a term-frequency ordered index as a template for future impact scores This is tentative and will likely change in the future. check-in: 3647143ac2 user: sam_hames tags: refactor-all | |
|
2025-12-09
| ||
| 00:00 | Add utility method for refining multiple clusterings at once check-in: 1fb88e7ba3 user: sam_hames tags: refactor-all | |
|
2025-11-30
| ||
| 23:31 | Refactor clustering algorithm function enable efficient computation of multiple cluster refinements at the same time - this supports things like subdividing a clustering and generate multiple clusterings from different randomised starting points easily. check-in: 7ec95a80ea user: sam_hames tags: refactor-all | |
| 09:21 | Plot cross posting groups first rather than last check-in: 5490579d3c user: sam_hames tags: refactor-all | |
|
2025-11-27
| ||
| 22:10 | Also plot top cross posted groups in the main figure check-in: e3b52bfed4 user: sam_hames tags: refactor-all | |
| 02:44 | Track logarithm base 2 term frequency for positional feature occurences check-in: 2f9443860a user: sam_hames tags: refactor-all | |
|
2025-11-24
| ||
| 22:31 | Make a wide rather than long results table for twenty newsgroups check-in: f5bff605b3 user: sam_hames tags: refactor-all | |
| 05:55 | Handle lines containing only whitespace other than newlines in paragraph detector check-in: 5f7c11ff5d user: sam_hames tags: refactor-all | |
|
2025-11-21
| ||
| 05:44 | Add some experimental cluster refinement options check-in: c6f8ffc219 user: sam_hames tags: refactor-all | |
|
2025-11-19
| ||
| 23:26 | Pull out the clustering refining algorithm into a separate function that takes mmapped bitmap files as input - this is to support reuse in different contexts such as clustering of different kinds of bitmaps check-in: 42e97da243 user: sam_hames tags: refactor-all | |
| 04:15 | Refactor mmapping of bitmaps to take arbitrary combinations of features - lays the groundwork for making the clustering algorithm more reusable across different contexts check-in: d087649ced user: sam_hames tags: refactor-all | |
|
2025-11-17
| ||
| 01:51 | Properly check for lack of intersection when testing adding a feature to a new cluster check-in: acc38c94b3 user: sam_hames tags: refactor-all | |
|
2025-11-04
| ||
| 06:25 | Allow dissolving selected clusters by moving all features to their next best cluster check-in: 3f01c16046 user: sam_hames tags: refactor-all | |
|
2025-10-24
| ||
| 06:04 | Slightly tweak signature handling, and change clustering granularity for newsgroups check-in: 69d4cb2ffe user: sam_hames tags: refactor-all | |
| 02:17 | Wire cluster refinement into the web UI check-in: 61a6495f9c user: sam_hames tags: refactor-all | |
| 01:53 | Improve detecting and removing signatures from newsgroups check-in: 7bbfe1215a user: sam_hames tags: refactor-all | |
| 00:36 | Highlight features and clusters currently included in the query check-in: d216d15f24 user: sam_hames tags: refactor-all | |
|
2025-10-23
| ||
| 22:35 | Make sure a scrollbar isn't generated for the whole page on Chromium check-in: 7fe45ef5e2 user: sam_hames tags: refactor-all | |
| 22:34 | Wire up cluster splitting into the web UI check-in: 7abea1f94b user: sam_hames tags: refactor-all | |
| 06:56 | Add some basic HTML tables for visualisation to the twenty newsgroup dataset check-in: 8e096e7e22 user: sam_hames tags: refactor-all | |
|
2025-10-22
| ||
| 05:15 | Reduce default number of features shown for clusters check-in: fba4307d47 user: sam_hames tags: refactor-all | |
| 00:32 | Further refinements to the table layout, visual hierarchy and colour scheme check-in: c4b2f7e1d6 user: sam_hames tags: refactor-all | |
| 00:31 | Bugfix: not recalculating the number of features after update check-in: 9a8ab38228 user: sam_hames tags: refactor-all | |
|
2025-10-21
| ||
| 23:55 | Update table styles to be more compact with display: grid check-in: 181c3c3057 user: sam_hames tags: refactor-all | |
| 23:54 | Smarter initialisation for splitting clusters check-in: b9c45945b2 user: sam_hames tags: refactor-all | |
| 03:30 | Use correct anchor link when generating cluster navigation to next/previous check-in: b71fa8585e user: sam_hames tags: refactor-all | |
| 03:22 | Expose delete clusters through web ui, rank facets by hits check-in: a26dda1d12 user: sam_hames tags: refactor-all | |
|
2025-10-20
| ||
| 23:12 | Cluster navigation expands navigated cluster to show all terms, bugfix with select boxes covered by bars check-in: 108f643f36 user: sam_hames tags: refactor-all | |
| 23:06 | Change visualisation to superimposed bar chart check-in: 9a53eb8401 user: sam_hames tags: refactor-all | |
| 07:01 | Further refinement to feature table layouts check-in: d41ec5249f user: sam_hames tags: refactor-all | |
| 07:00 | Final clustering parameters for twenty newsgroups check-in: eb9921d65e user: sam_hames tags: refactor-all | |
|
2025-10-19
| ||
| 23:56 | Remove rowspanning field cells, de-emphasise repeating values instead check-in: 1ac2d23d4a user: sam_hames tags: refactor-all | |
|
2025-10-15
| ||
| 22:55 | Add cluster navigation/step by step check-in: aa7737f3f2 user: sam_hames tags: refactor-all | |