Hyperreal

Timeline
Login

Many hyperlinks are disabled.
Use anonymous login to enable hyperlinks.

50 most recent check-ins

2025-08-13
03:47
Make feature_clustering using a fixed random_state object, that can be directly set by the user for repeatable randomness over a series of operations Leaf check-in: 48bac0160b user: sam_hames tags: refactor-all
01:42
Tweak urls and highlight-features for the cluster drill-down view check-in: 46d668e44b user: sam_hames tags: refactor-all
01:31
Make feature clusters editable through the web UI check-in: 37491ee032 user: sam_hames tags: refactor-all
2025-08-12
07:39
Bugfix: not passing through max_workers correctly to index.rebuild check-in: e45dfbefec user: sam_hames tags: refactor-all
05:51
Experimental interface to make the features used in a query available to a corpus renderer for highlighting of results - this is a WIP and will probably change check-in: a482c62dd5 user: sam_hames tags: refactor-all
2025-08-11
23:40
Add footer links to drilldown into the selected cluster view check-in: a48bc2c3ca user: sam_hames tags: refactor-all
22:45
Refine display and alignment of quantities with headers check-in: cfeceb92d6 user: sam_hames tags: refactor-all
03:28
Refine layout and styling by enabling different sized columns, and showing both doc_counts and similarity marks aligned check-in: 2bc9befd80 user: sam_hames tags: refactor-all
2025-08-10
02:12
Show number of sampled and matching documents in header for matches check-in: d2b879ef66 user: sam_hames tags: refactor-all
02:12
Handle quoted lines consistently with new helper method check-in: 61b7ad181e user: sam_hames tags: refactor-all
01:27
Try a different CSS styling for a more compact display that uses more screenspace when available check-in: 039f273d79 user: sam_hames tags: refactor-all
01:26
Add navigation to next and previous clusters for the cluster drilldown view check-in: 28e3b54bde user: sam_hames tags: refactor-all
2025-08-08
07:37
Add an additional endpoint to handle drilling down into the full detail of a cluster - this displays the features in the selected cluster next to all other clusters, sorted by similarity and enables more specific drilldown. check-in: 5f1115dc48 user: sam_hames tags: refactor-all
2025-08-07
06:39
Calculate cluster facets in parallel using the background pool. Refactor how the db state is managed by ensuring it is always accessed through the HyperrealIndex object, rather than the previous convenience assignment - this assignment prevents a feature cluster plugin from being picklable/usable in a background pool. check-in: 0b16c7b9a0 user: sam_hames tags: refactor-all
05:49
On seconds thoughts, don't change the objective, just leave it how it is and come back to it later in a more systematic way check-in: a9cc808915 user: sam_hames tags: refactor-all
01:52
Display a sample of all documents on the cluster browser view with no selected feature/cluster check-in: 2241285f61 user: sam_hames tags: refactor-all
01:38
Web related fixes: pass through extra_css from the corpus, list docs properly in the cluster browser, also enable selecting the port to serve on check-in: 3476bcde9c user: sam_hames tags: refactor-all
01:35
Change algorithm to penalise according to the minimum of n_docs and hits - this will more heavily penalise clusters with a large number of long features. check-in: 30257ab088 user: sam_hames tags: refactor-all
2025-08-04
06:50
Add split_cluster_into functionality on the feature clustering check-in: 60e63f3366 user: sam_hames tags: refactor-all
05:50
Cleanup lint warnings for feature_cluster.py check-in: 64b8c80f3f user: sam_hames tags: refactor-all
05:33
Apply/lint formatting for import order check-in: 8820fc6831 user: sam_hames tags: refactor-all
05:31
Add new functionality for merging clusters, add test cases for cluster level operations, also fix a transaction management bug raised by doing this work check-in: 728c284789 user: sam_hames tags: refactor-all
02:35
Tidy up the docstrings for the ValueHandler check-in: 2f244af160 user: sam_hames tags: refactor-all
2025-07-30
03:53
Change handler interface so they have access to the corpus check-in: 1995e12ca2 user: sam_hames tags: refactor-all
2025-06-06
06:54
Add stub configuration for generating documentation using sphinx. This enables all-in-one-page HTML generation, includes version info that will help align it with fossil based publishing and linking, and also the facilities for including selected API docs are what I want. check-in: 81cdfc1b93 user: sam_hames tags: refactor-all
2025-06-04
04:16
Improve handling of quoted components in body of posts, handle jupyter notebook and script invocation of the server from the source notebook, CSS tweak, remove html_indexable_doc interface from the corpus check-in: 15d281e374 user: sam_hames tags: refactor-all
2025-04-25
05:19
Allow choosing top_k features from field_features check-in: fb7dec78e0 user: sam_hames tags: refactor-all
05:17
Allow pivoting by a range query in the web UI check-in: 85d57e5153 user: sam_hames tags: refactor-all
2025-04-18
12:14
Bugfix: handling range features with no lower bound correctly check-in: 56866bf229 user: sam_hames tags: refactor-all
2025-04-08
04:26
Flesh out the facted visualisation further to display more than one thing, and consistently with the clustering display check-in: 90c0b23fcb user: sam_hames tags: refactor-all
04:25
Bugfix: incorrectly handling the edge case for the edge of a range encoded literal value check-in: 240a0acd0e user: sam_hames tags: refactor-all
2025-04-07
10:51
Make the facets linked for filtering and drilling down check-in: c20acfc49c user: sam_hames tags: refactor-all
05:50
Randomly sample documents for display, refine filtering interface/helpers, and spike out a facet rendering view check-in: 47123c4ebb user: sam_hames tags: refactor-all
2025-04-06
22:55
Randomly subsample documents for display check-in: 0651ec0816 user: sam_hames tags: refactor-all
22:33
Styling tweak to prevent vertical scrollbars on cluster view check-in: ccd4cd6adc user: sam_hames tags: refactor-all
08:08
Full pivoting of the clusters and features check-in: 6d4ef27f80 user: sam_hames tags: refactor-all
06:06
Style a header element for clusters check-in: baa03ef7ce user: sam_hames tags: refactor-all
05:32
Make browse view linked for features check-in: 68805981c3 user: sam_hames tags: refactor-all
2025-04-05
03:55
Tweak spacing of HTML cluster layout check-in: e6b77e2662 user: sam_hames tags: refactor-all
2025-04-04
12:34
Preliminary cluster visualisation check-in: 0e80908836 user: sam_hames tags: refactor-all
10:34
Initial area-mark visualisation for counts, reorganisation of some of the web interface, and further feature clustering implementation check-in: 634b46addc user: sam_hames tags: refactor-all
2025-03-31
09:32
Allow saving clusters, prepare more of the visualisation elements check-in: a8e140e868 user: sam_hames tags: refactor-all
2025-03-25
04:39
Add an index to search by doc_count in inverted_index check-in: 76528a37b3 user: sam_hames tags: refactor-all
2025-03-24
02:19
Add display of matching documents to the field overview page check-in: fdc614705e user: sam_hames tags: refactor-all
2025-03-10
06:56
Allow extracting features for range encoded fields check-in: 3a889f0e5d user: sam_hames tags: refactor-all
06:33
Add missing file, allow rendering content as multiple independent columns check-in: 42039c4346 user: sam_hames tags: refactor-all
04:01
Tweak the table rendering - HyperrealIndex object should always return field values as the Python representation of the index. check-in: e1b986bc96 user: sam_hames tags: refactor-all
01:35
Render the index summary and a summary of the fields through the webserver component check-in: a87e8f6a42 user: sam_hames tags: refactor-all
2025-02-11
07:01
Get initial core of the feature clustering algorithm working in the new context check-in: 1e8c5c15c5 user: sam_hames tags: refactor-all
00:32
Add stub for web server, migrate more components to Features instead of FieldValues check-in: 8906699856 user: sam_hames tags: refactor-all