#indiewebcamp 2015-04-30

2015-04-30 UTC
#
tantek
hmm - I'm having trouble with a note article distinction
#
tantek
what started as a medium note (no pun intended), became a long note, then with lists (yes plural)
#
tantek
and it still has the tone of a quick "note", but now is starting to look/structure like a blog post / article.
#
tantek
and yet I'd almost rather prefer the informality of a long (semi-structured) note than do anything so formalizing as put a name/title on it
#
tantek
does anyone else ever have these issues when writing medium/long notes?
#
GWG
Good evening
#
tantek
evening GWG, any thoughts on the above ^^^
#
aaronpk
why is case insensitivity a different problem?
benwerd joined the channel
#
aaronpk
these are all problems with trying to map URLs to a filesystem
#
tantek
aaronpk - I'm still thinking on a solution for case insensitivity.
#
aaronpk
i haven't heard of a solution for the file+folder problem yet either
#
tantek
huh? I can put "/" in a folder name in OSX
#
tantek
not sure how it escapes it, but it "works" in the UI
#
aaronpk
that doesn't sound very portable, and doesn't work on the command line so probably also doesn't work from code, but haven't tried in code yet
#
GWG
If it is that long, I always switch to a note.
#
aaronpk
i wouldn't be able to put that on a linux filesystem to run on a web server though
#
tantek
huh? why not try the UI action then doing an ls in the terminal to see how it works?
#
aaronpk
it uh, has a ":" when I do ls
#
aaronpk
and `mkdir test:` results in a / in the UI
#
aaronpk
which means now I can't store files with ":" in the URL
#
tantek
hmm I can't put a ":" in the name of a folder in the Finder
#
tantek
still it solves your immediate real problem
#
tantek
oh easy hack, use "//" for a ":" - only the trailing single-slash "/" is used for folder names :)
#
tantek
now about those capital letters
#
tantek
actually, better idea
#
tantek
URL escape both ":" and any capital letters. Done
#
aaronpk
that's not gonna work on linux though
#
aaronpk
all of these sound like hacks, or like clever workarounds that are osx specific
#
tantek
":" -> %3A
#
tantek
"E" -> %45
#
tantek
all done :D
wolftune joined the channel
#
tantek
A-Z => %41-%5A
#
tantek
none of them are OSX specific
#
aaronpk
osx is the only way i can have both "test" and "test/" on the filesystem
#
tantek
you can't use ":" in the filename on other systems?
#
aaronpk
oh i guess i can name a folder "test:" on linux
#
tantek
as it is in the command line on OSX
#
tantek
no difference there
#
aaronpk
that might work
#
aaronpk
i'd better document the three problems again just to double check
#
tantek
just leave it to me to come up with short syntax hacks, whether for class names, or file / folder names ;)
#
aaronpk
of course URL escaping capital letters loses the readability of those
#
aaronpk
luckily most URLs are mostly lowercase
#
tantek
yeah, acceptable compromise for the rarer frequency of capital letters
#
tantek
see, 80/20 rule and everything
#
tantek
solving that kind of problem I find much easier than things like note/article distinction :/
cmhobbs joined the channel
#
GWG
I think I mispook
#
GWG
mispoke.
#
GWG
I'm off today it seems
#
GWG
If it is that long, I always switch from a note.
#
tantek
one consideration is the effect on POSSEing, specifically, how Bridgy Publish will treat it, e.g. when POSSEing to FB
#
tantek
specifically, it seems Bridgy Publish ignores semantic markup in an article like lists, which are often essential to convey the meaning of a post
#
aaronpk
we need one more rule
#
aaronpk
if the URL you're saving ends in /, then URL encode that
#
tantek
whereas if I do the whole thing with plain text and whitespace, then the list bullets / numbers and formatting are all done there, which Bridgy Publish (mostly) propagates
#
aaronpk
in order to be able to store 2015 2015/ and 2015/Germany
#
tantek
I do find that I have to edit the Bridgy Publish FB POSSE copy and manually add back the linebreaks
mdik joined the channel
#
tantek
aaronpk - yes that makes sense
#
tantek
perhaps worthy of documenting on /IndieArchive, and cite today's log as the reasnonining / derivation
#
aaronpk
this does mean this will always require code to serve these files back via http, since this won't work with apache or nginx filesystem serving
#
tantek
curious what snarfed / kylewm thinks in terms of what Bridgy Publish could/should do with both markup inside articles, and whitespace inside notes
#
tantek
aaronpk - indeed
#
aaronpk
lastly, regarding versioning and human-readability of the filesystem, i think it would make more sense to put the timestamp at the end
#
snarfed
fwiw bridgy publish uses html2text (https://github.com/Alir3z4/html2text) to convert html to text
#
aaronpk
archive/example.com/path/to/file/DDD/SSS or file.DDD.SSS
#
snarfed
i'd love to support more kinds of formatting, but i'm not very interested in implementing it myself
#
tantek
aaronpk - I think that results in more folders :(
#
aaronpk
otherwise content from example.com could be spread around any number of DDD/SSS folders
#
GWG
snarfed: Got a moment?
#
snarfed
GWG: just a moment, sure
#
GWG
snarfed: Got a chance to look at that plugin?
#
tantek
do I need to add "Got a moment?" to CommProtocols? ;)
#
snarfed
GWG: context? which plugin?
#
GWG
tantek: I flaunt social conventions.
#
aaronpk
i'm imagining asking myself this later: "where is the latest version of X" or "how many versions of X do I have"
#
snarfed
tantek: lol probably yes
#
GWG
snarfed: The Indie-Webactions one?
#
snarfed
GWG: ah. you mean, can i install and try it?
#
snarfed
not now but soon!
#
tantek
aaronpk I believe archivists keep things by year in general
#
GWG
snarfed: I continue to try to get to stable.
#
tantek
that is, the question of, what sites did I reference / archive in year / day x?
#
aaronpk
archivists also came up with the WARC format so...
#
tantek
is more interesting / frequent than "where is the latest version of X" or "how many versions of X do I have"
#
aaronpk
in practice i have needed to find a file on disk in order to delete the cached version more often than i have ever asked my self "what did I reference in Y"
#
aaronpk
"what sites did I reference in Y" can almost as easily be answered by just reading my web pages from that year
#
GWG
I'm still trying to figure out...Portland or Edinburgh
#
tantek
aaronpk - another problem - putting the DDD/SSS at the end breaks the URL path
#
tantek
into two pieces
#
aaronpk
could be . instead of /
#
tantek
since you end up putting the filename.ext inside DDD/SSS :(
#
tantek
that's the other reason I remember
#
aaronpk
no, i would tack that on to the end in either case
#
tantek
I wanted to keep the paths intact
#
aaronpk
remember this is the filesystem representation, which has nothing to do with the URL it's served from
#
tantek
which works really well for the *typical* case
#
tantek
of lowercase URLs
#
aaronpk
so we can still use the URL format documented on /IndieArchive
#
aaronpk
it's just a matter of mapping that URL to a filesystem path
#
tantek
hmm - I'm reconsidering "tag/" for folders
#
tantek
wondering if "tag." for the file would be better, and "tag" for the folder
#
tantek
the advantage being, that the directory structures would actually map to URLs
#
tantek
like relative paths in the files would actually work "locally"
#
tantek
I like inspectability of such systems
#
tantek
e.g. being able to just double-click a .html file in a folder, and have it find relative paths to .css files etc.
#
tantek
IMO inspectable archives are far more reliable
#
aaronpk
that's a whole different problem i think
#
tantek
as in, inspectable *cross-platform* *without* having to *always* run custom code
#
aaronpk
i wouldn't count on that working most of the time, based on my experience trying to archive several websites
#
tantek
no not a different problem at all
#
tantek
inspectability is key to longevity
#
tantek
even if it's only 80%
#
aaronpk
it is in practice, because people have css/js files on other domains, or with weird characters in them all the time
#
tantek
not "all the time"
#
aaronpk
100% of the sites i archived had this happen
#
tantek
that's a horrible mischaracterization. I'd say 80/20 is *same domain*
#
tantek
except external libraries like jquery and webfonts
#
aaronpk
anyway gotta run to dinner, back online later!
#
Loqi
slack/snarfed: btw tantek sorry, didn't mean to be so brusque. I'd happily accept PRs against html2text!
#
tantek
ah interesting!
#
tantek
ok for the article with markup -> text for FB etc.
#
tantek
e.g. serializing various semantic HTML elements into default presentation. p blockquote ol ul li
snarfed, KevinMarks, KevinMarks__, yakker, j12t and KartikPrabhu joined the channel
#
kylewm
tantek: how do you think long notes with whitespace should look in a reader?
#
kylewm
or just notes with whitespace, length is irrelevant
#
kylewm
my impression is that your use of white-space: pre-wrap; is somewhat unorthodox?
#
tantek
kylewm: it's not unorthodox at all, but rather following a pattern established by both Twitter and Facebook
#
tantek
what is a note?
#
Loqi
A note is a post that is typically short unstructured* plain text, written & posted quickly, that has its own permalink page https://indiewebcamp.com/note
#
tantek
kylewm: plenty of research and analysis for your review: https://indiewebcamp.com/note#Whitespace
#
tantek
this is perhaps a key distinguishing and useful factor of explicitly typing notes vs. articles - notes can be expected to preserve whitespace, and auto-link / auto-embed
#
kylewm
looking at twitter css now, i wouldn't have guessed that's how they did linebreaks. interesting!
#
tantek
whereas articles are expected to have explicit markup instead of preserving whitespace and auto-linking / embedding
#
tantek
kylewm: yeah I tried to be pretty thorough before implementing :)
#
tantek
looks like I started documenting research and thoughts about it just over 2 years ago! https://indiewebcamp.com/wiki/index.php?title=note&direction=next&oldid=1803#White_space
#
kylewm
so your photos in notes are autolinked, and not embedded with <img> markup?
#
tantek
that's right
#
tantek
when authoring I "just" drop in a .jpg URL
#
tantek
knowing that it will "just work"
#
tantek
the way that Slack does for example
#
kylewm
does anyone else in IWC have pure-plaintext notes like that?
#
tantek
I've also brainstormed a bit about how I would auto-link plain text image URLs with alt text, to other URLs etc., but have not expanded CASSIS auto_link to do that yet: http://tantek.pbworks.com/w/page/59905776/Markdown#Hyperlinkswithlinktext
#
tantek
ugh sorry about that silo URL
#
tantek
better: tantek.com/w/Markdown#Hyperlinkswithlinktext
#
tantek
kylewm: AFAIK - *everyones* plain text notes work that way
#
kylewm
my notes are Markdown so linebreaks turn into <br>s etc.
#
tantek
well for those that actually built a separate "note" type of post, rather than just hacking notes as title-less articles in their existing blog posting system
#
tantek
yes it was less work to use white-space prewrap and it was good enough for twitter
snarfed joined the channel
#
tantek
and FB too - when I was manually POSSEing to FB - I could just copy/paste my notes directly into it and it would "do the right thing" - no markup needed
#
kylewm
any suggestion on how to support this in Woodwind? use whitespace pre-wrap if there's no title?
#
tantek
yes, per "notes can be expected to preserve whitespace, and auto-link / auto-embed"
#
kylewm
well, your post has the links already autolinked and the photos already autoembedded
#
tantek
CASSIS auto_link is smart enough to not doubly do so
#
tantek
you can call it twice on the same input and you get the same thing
#
tantek
the bigger challenge is p-content vs e-content
#
tantek
kylewm: rather than "if there's no title", please use this Note Type Algorithm: http://indiewebcamp.com/note#Note_Type_Algorithm (and or report any problems you find with it)
#
tantek
sorry what I meant was auto_link(x) == auto_link(auto_link(x))
#
kylewm
yeah that's what i mean, if there's no p-name which is a title separate from the content
#
kylewm
i use a slightly different algorithm in mf2util, but it's based on this one
#
tantek
anyway - the "is this a note" algorithm is more complex than just "no title" that's all
#
tantek
hence I keep conditionalizing it "for notes" or "if it's a note"
#
tantek
rather than by some specific attribute
#
tantek
I'm really curious how people see a FB post that you have explicitly hidden from your timeline/profile
#
tantek
(evidence that some are seeing it - likes on it)
#
tantek
kylewm: re: your use of markdown for your notes, please add that here! http://indiewebcamp.com/note#markdown_or_plain_text
#
kylewm
i see p3k also uses pre-wrap, not sure how i missed this detail
#
tantek
(especially since we have a different approach - hence encouragement of documentation to show diversity of approaches)
#
KartikPrabhu
would appreciate testing/feedback on my UI to get the fragmentioned URL of selected text: https://kartikprabhu.com/static/demo/fragmention.html
#
kylewm
barnabywalters is on team <p></p> though!
#
kylewm.com
edited /note (+244) "/* markdown or plain text */ add myself"
(view diff)
#
kylewm
KartikPrabhu: really nice. i'd sorta like if it rounded off to the nearest word if my selection is a little sloppy
#
KartikPrabhu
aah I see
#
KartikPrabhu
so drop trailing spaces and punctuations?
#
kylewm
looking at Fragmentions for poets to see if it uses the punctuation in a meaningful way
#
KartikPrabhu
also I'm using the latest syntax with one # and %20 escaped spaces, since if this works the space -> + is not needed
#
Loqi
slack/tantek: Note that space to + is only really valid in ? Query params
#
kylewm
yeah I think this UI is really nice. although the document sort of illustrates that paragraph level quoting is not always specific enough
#
KartikPrabhu
tantek: yes, hence the new syntax preference
#
KartikPrabhu
kylewm: true! but that is a fragmention.js issue that can be very independently fixed
#
KartikPrabhu
also my code does not check if the text is first occurence or not, which might make it fail
danlyke joined the channel
#
kylewm
KartikPrabhu: although it does give you immediate feedback if your selection fails
#
kylewm
one of the first ones i tried, there was an earlier instance
#
kylewm
actually i thought that was pretty cool :)
#
KartikPrabhu
side effect of not trying to do too much :P
danlyke, LCyrin and j12t joined the channel
j12t, lukebrooker and KevinMarks joined the channel
#
KevinMarks
Does blocking googlebot block appengine fetch?
#
KevinMarks
Kylewm I'm using the same note technique for app.willsomeone.com
#
KevinMarks
Tantek, have you looked at inlining tweets or other note urls like you do with images?
#
KevinMarks
I know aaron does that with twitter urls
elf-pavlik joined the channel
#
Loqi
slack/snarfed: KevinMarks_: re googlebot and app engine fetch, it shouldn't
glennjones, j12t, KevinMarks__ and nloadholtes joined the channel
#
notizblog.org
edited /User:Notizblog.org (+90) "/* related articles (print) */"
(view diff)
loic_m and pfefferle joined the channel
#
@DataC5155
Most of the main site is finished, but all of the product pages still need to be done. #indieweb #indiedev https://twitter.com/oSoftDev/status/593667815007191040
(twitter.com/_/status/593668393619820544)
#
@TheIndieSloth
RT @DataC5155: Most of the main site is finished, but all of the product pages still need to be done. #indieweb #indiedev https://t.co/aRK…
(twitter.com/_/status/593668437710348288)
#
@IndieGameDevBot
RT @DataC5155: Most of the main site is finished, but all of the product pages still need to be done. #indieweb #indiedev https://t.co/aRK…
(twitter.com/_/status/593668589833576448)
#
@GameDevRobot
RT @DataC5155: Most of the main site is finished, but all of the product pages still need to be done. #indieweb #indiedev https://t.co/aRK…
(twitter.com/_/status/593668869358788608)
#
@digisim
RT @benwerd: This piece about Known makes me so happy. Owning your space is about being free. http://audreywatters.com/2015/04/29/reclaim-known/ #indieweb #reclaimyour…
(twitter.com/_/status/593669017119907840)
Jihaisse and j12t joined the channel
#
pfefferle
good morning
tantek, pfefferle and j12t joined the channel
j12t_, petermolnar and jonnybarnes joined the channel
#
voxpelli
good morning!
#
petermolnar
good morning
tilgovi, Sebastien-L, j12t, adactio, LynnCyrin, evalica and j12t_ joined the channel
#
acegiak
yo, if I were gonna show someone one youtube video to introduce them to the indieweb movement what should I show them?
#
pfefferle
acegiak if vimeo is ok too… https://vimeo.com/77352414
#
acegiak
pfefferle: yeah definitely thanks!
squeakytoy, stream7, eschnou, elima and LauraJ joined the channel
LauraJ, frzn, elima and KevinMarks joined the channel
#
rhiaro
adactio: I couldn't make 20/21 June in Brighton but could do 11/12 July
#
rhiaro
!tell barnabywalters: less likely I could do 11/12 July in Berlin now (but not impossible); if there was one in Brighton then instead I'd go to that
#
Loqi
Ok, I'll tell him that when I see him next
#
petermolnar
I'd prefer a Brighton one as well ( or an Amsterdam one )
#
petermolnar
oh, there is one in Brighton, I didn't realize that
KevinMarks__ joined the channel
#
rhiaro.co.uk
edited /2015 (+28) "/* Candidate Cities */"
(view diff)
stream7_ joined the channel
#
Loqi
[mention] Pierre Ozoux posted 'Introducing OuiShareLabs Camp 3 min read Introducing OuiShar...' linking to http://indiewebcamp.com/store (https://microblog.pierre-o.fr/2015/introducing-ouisharelabs-camp)
#
voxpelli
petermolnar: add yourself under the Brighton one in the wiki: https://indiewebcamp.com/2015#Candidate_Cities :)
friedcell joined the channel
#
petermolnar.eu
edited /2015 (+25) "/* Candidate Cities */"
(view diff)
#
petermolnar.eu
edited /2015 (+0) "/* Candidate Cities */"
(view diff)
#
petermolnar
wtf, why can't I add my username?
#
petermolnar.eu
edited /User:Petermolnar.eu (+17) "/* used silos */"
(view diff)
#
petermolnar.eu
created /petermolnar (+32) "Redirected page to [[User:Petermolnar.eu]]"
(view diff)
#
petermolnar.eu
edited /2015 (-3) "/* Candidate Cities */"
(view diff)
#
petermolnar.eu
created /Template:petermolnar (+146) "Created page with "<span class="h-card">{{sparkline|https://s.gravatar.com/avatar/1915b220dfe0cc56209cb4d11b389383?s=128}}[[User:Petermolnar.eu|Peter Molnar" target="_blank">User:Petermolnar.eu|Peter" target="_blank">https://s.gravatar.com/avatar/1915b220dfe0cc56209cb4d11b389383?s=128}}[[User:Petermolnar.eu|Peter Molnar]]</span>""
(view diff)
mlncn and KevinMarks joined the channel
KevinMarks__, LauraJ and LaurieJ joined the channel
#
@g16n
@iandevlin Can you recommend some place to stay in Düsseldorf (IndieWebCamp and btconf)?
(twitter.com/_/status/593754945695981568)
Erkan_Yilmaz, LauraJ, parzzix, danlyke, KevinMarks, j12t, bupkes, snarfed, KevinMarks__ and fourtonfish joined the channel
#
@hmans
@mbrochh Es ist ein auf IndieWeb-Ideen aufsetzendes, verteiltes Social Network. Es ist anders/besser als Diaspora. Details per Mail?
(twitter.com/_/status/593772928333979648)
eschnou, glennjones, zero-gravitas, KevinMarks, KevinMarks__, chalettu, tantek, KevinMarks___, j12t, chalettu_, parzzix, tvn, csarven, friedcell, AcidNerd, elima, wolftune, evalica, danlyke and snarfed joined the channel
#
aaronpk
kylewm: have you considered showing the syndication URLs on posts in woodwind?
j12t, todrobbins, yakker, mlncn and snarfed joined the channel
fourtonfish and snarfed joined the channel
#
aaronparecki.com
edited /IndieArchive (+1711) "add notes on how to store this on disk"
(view diff)
#
aaronparecki.com
edited /IndieArchive (+118) "/* Storage */ nowiki"
(view diff)
#
aaronparecki.com
edited /IndieArchive (+0) "/* Storage */ caps"
(view diff)
#
aaronpk
oh i didn't include anything about timestamps yet
indie-visitor joined the channel
#
Loqi
Welcome, indie-visitor! Set your nickname by typing /nick yourname
#
aaronpk
false positive?
#
aaronpk
oh hey it says "see original" on it!!
#
snarfed
false positive for PSC extractors since it ends in (10.10 Yosemite)
#
snarfed
the see original is definitely nice though!
#
snarfed
Computers R Dumm
todrobbins joined the channel
#
tantek
snarfed, 10.10 is not a valid TLD :P
#
aaronpk
actually 10.10 is shorthand for the IP address 10.0.0.10
#
tantek
"Permashort citations are the parenthesized ccTLD and path" http://indiewebcamp.com/PSC##parenthesized
#
ben_thatmustbeme
10.10 isn't a valid URL?
#
tantek
snarfed, you're welcome to re-use the regex for ccTLDs from CASSIS.js auto_link :D
almereyda joined the channel
#
snarfed
tantek: oh god
#
aaronpk
it's beefy
#
snarfed
thank you so much, but…um…hey look over there!
#
snarfed
runs away
#
aaronpk
it also doesn't work wih the new gTLDs
#
tantek
deliberately
#
tantek
I meant it when I said **ccTLDs**
#
snarfed
don't get me wrong, cassis itself is great, i just really don't want to maintain a copy of that if i can avoid it
#
tantek
what are the chances that new countries will be introduced? ;)
#
snarfed
when i worked on a payment processing system that supported most of the world for a few years, it happened roughly twice a year :P
#
snarfed
(counting adds, removes, and changes)
#
snarfed
same w/currencies
#
aaronpk
tantek: did you get a chance to look at this? I tried writing up the discussion from yesterday https://indiewebcamp.com/IndieArchive#Storage
#
tantek
currencies are worse than countries
#
tantek
aaronpk - yeah - looks good - though what did you think of keeping foldernames the same (no : / ) and using an trailing "." for extensionless filenames like "tag" ?
#
aaronpk
would you also append a "." to filenames like "styles.css"?
#
aaronpk
i think you'd have to in order to be consistent
#
tantek
oh because that could be a folder name?!?
#
aaronpk
otherwise you're back in the same boat, where a folder that has a "." can't be used as a filename (example.com/foo.css and example.com/foo.css/bar)
#
aaronpk
and i've actually seen that happen
#
tantek
heck, example.com :P
#
kylewm
aaronpk: syndication posts on urls in woodwind -- right now I show them if the syndicated post was also found by woodwind... do you think it would be useful to show them all the time?
#
aaronpk
tantek: haha actually the archive itself is going to have URLs like that
#
tantek.com
edited /IndieArchive (+81) "/* Origins */ found citation for the OSCON session that inspired this discussion"
(view diff)
#
kylewm
snarfed: tantek: I use cassis to count characters in my UI and my own python regex to shorten the actual tweet text -- bit me the other day when I was writing about hub.mode and hub.url. cassis correctly ignored them, but my code thought they were urls
#
tantek
pretty sure that's why twitter gave up on auto-linking plain ccTLDs
#
tantek
which is why PSC work at all - they depend on that one neat trick
#
tantek.com
edited /IndieArchive (-7) "/* Origins */ -dup 2011"
(view diff)
#
aaronpk
kylewm: i don't know if you've seen the hackernews feed in your logs, but I add syncation URLs for the posts pointing to the HN URL, and it would be useful to show those in woodwind
#
kylewm
that's interesting! do you think that's an overload of "syndication", or not?
barnabywalters joined the channel
#
aaronpk
depends on whether syndication is meant to mean syndicated by the author of the post
#
aaronpk
I've submitted my own posts to HN before, and included the syndication url to the HN version on my post
#
aaronpk
and I've seen many other HN posts link to the HN copy, whether or not it was submitted by the author
KevinMarks and Erkan_Yilmaz joined the channel
#
aaronpk
tantek: another thing I was thinking is that it'd be useful to be able to keep the HTTP headers that were part of fetching the URL (in order to preserve content-type or modified date for example)
#
aaronpk
so I was thinking about storing the headers in a file alongside the page, like page.headers
#
aaronpk
at which point I could store the page in page.data, and then folders could just be folders without the ":"
#
barnabywalters
aaronpk: taproot/archive stores the headers in a .txt file with the same name as the .html file
#
Loqi
barnabywalters: rhiaro left you a message 5 hours, 55 minutes ago: less likely I could do 11/12 July in Berlin now (but not impossible); if there was one in Brighton then instead I'd go to that http://indiewebcamp.com/irc/2015-04-30/line/1430392772992
#
aaronpk
barnabywalters: do you store css or image files too?
#
barnabywalters
nope, only HTML so far
#
aaronpk
i'm trying to make this work for all kinds of files
#
barnabywalters
my archives are quite big enough without storing a bunch of CSS, much less images!
#
aaronpk
i'm at 400mb now, but if I gzip that it'll be a lot smaller
#
barnabywalters
here’s what the top level of my archive of your site looks like: https://waterpigs.co.uk/img/screen-shot-2015-04-30-at-171613.png
Erkan_Yilmaz joined the channel
#
aaronpk
only one version of my home page?
#
barnabywalters
aaronpk: yeah, I very rarely link to your homepage
#
barnabywalters
if I did more often then there would be more versions, as microformats changes will trigger a new version creation
#
tantek
but it's changed so much!
KartikPrabhu joined the channel
#
barnabywalters
when I was building taproot/archive I looked into using a zip archive as the fake filesystem. It has a bunch of benefits including vastly reduced filesize, fairly easy to inspect (just unzip) and it works like a key-value based file storage system, where trailing slashes are significant and any character can be used in the keys
#
barnabywalters
the problem with that is that as soon as you unzip it, the keys break (IIRC — I was experimenting with this a looong time ago)
#
aaronpk
yeah you'd have the case sensitivity and weird-chars-in-filenames problem when you unzip it
#
barnabywalters
yeah, that’s what I was going to use
#
barnabywalters
there’s also the danger that it could become corrupt more easily than a regular filesystem
#
aaronpk
interesting
#
aaronpk
indeed
#
barnabywalters
but that could be mitigated by regularly unzipping the active archive onto long-term backup media
#
barnabywalters
doesn’t solve the file path characters/case sensitivity issues though
#
aaronpk
i think if i used that, i'd still want to normalize the filenames using the rules i just added to /IndieArchive
j12t joined the channel
#
barnabywalters
aaronpk: looking over your algorithm, I can’t figure out if it gracefully handles treating http://example.org:80/path/to/file and http://example.org/80/path/to/file as separate URLs
#
loqi.me
created /EtherCalc (+102) "prompted by tantek and dfn added by tantek"
(view diff)
#
aaronpk
ooh damn edge cases
#
barnabywalters
IIRC that occurred to me when I was building taproot/archive, but I couldn’t figure out a good solution so ignored it
#
aaronpk
oh actually as described they would not conflict, but my examples don't match the description!
#
barnabywalters
oh, would it be handled by the trailing escaped slash thing?
#
aaronpk
yeah all folders get a colon appended, including the domain name "folder"
#
aaronpk
so it'd be http/example.org:80:/path:/to:/file and http/example.org:/80:/path:/to:/tile
#
aaronpk
however my intent was not to include the colon on the domain name, but maybe it's actually necessary after all
#
barnabywalters
those folder names with slashes in are creepy
#
aaronpk
that's just OSX rendering : as /
#
barnabywalters
I know, but it’s WEIRD
#
tantek.com
edited /EtherCalc (+215) "open source, Node.js based"
(view diff)
#
barnabywalters
so my vaguely indieweb project yesterday evening and this afternoon was getting http://radio.waterpigs.co.uk set up to do live streamed audio on my domain using icecast2
#
aaronpk
oh wait a sec
#
tantek.com
edited /node (+0) "-r"
(view diff)
#
aaronpk
the first url escape takes care of the :80
#
barnabywalters
right now you can hear uninteresting traffic and typing noises from my office: http://radio.waterpigs.co.uk/live
#
tantek
barnabywalters: have you told adactio about this? I bet he'd be interested, per huffduffer and all that.
#
barnabywalters
tantek: nope, it only started working properly 30 minutes ago :)
#
snarfed
icecast! oh man, blast from the past
#
snarfed
winamp era
#
barnabywalters
icecast2 was super easy to set up on my server, but getting apache to proxy requests through was a bit trickier
#
tantek
what is icecast2?
#
barnabywalters
snarfed: ha ha yeah, and the web UI uses XSLT!
#
barnabywalters
so if I type Icecast2 (different capitalisation) will loqi handle it correctly?
#
aaronpk
i don't remember
#
barnabywalters
okay, I’ll do lower case and move afterwards to cover all the bases
#
tantek
try it and find out!
#
barnabywalters
Icecast2 is an open source audio streaming server http://icecast.org/
#
loqi.me
created /Icecast2 (+96) "prompted by tantek https://indiewebcamp.com/irc/2015-04-30/line/1430415340898 and dfn added by barnabywalters"
(view diff)
#
snarfed
reminds me of one of the few windows apps i ever wrote, https://snarfed.org/remote
#
barnabywalters
apparently it likes it
#
aaronpk
snarfed: wow
#
tantek
gives Loqi a capitalized word.
#
Loqi
hearts the capitalized word.
#
snarfed
"blaize.stanford.edu" oh man memories
sparveri1s joined the channel
#
kylewm
wow, this is hilarious. someone on my dorm wrote an aggregator for everyone running Winamp with the SpyAmp plugin, so you could see what everyone else on the floor was listening to
natwelch, tilgovi, revere, KartikPrabhu and todrobbins joined the channel
#
aaronparecki.com
edited /IndieArchive (+213) "/* Storage */ add screenshot"
(view diff)
#
barnabywalters
so now I’m trying to figure out a way of automatically recording every icecast broadcast I do, storing it in an archive by mountpoint (URL) and datetime
#
barnabywalters
there doesn’t seem to be any way of conveniently hooking into icecast though
#
barnabywalters
but as I’m proxying requests to it through apache, maybe there’s some apache feature I can (ab)use
acegiak and zero-gravitas joined the channel
#
barnabywalters
ah, and there’s a playlist log file which I could watch for changes
#
barnabywalters
I wonder if there’s already a tool for shimming logfiles into pub/sub type event dispatcher systems
#
aaronpk
oh you could even create a symlink from the plain filename to the latest versioned filename
#
barnabywalters
aaronpk: hm, I actually hadn’t thought about making them public, but that would make total sense if I did
#
barnabywalters
yeah, SOURCE /live log lines turn up whenever a broadcast is started or stopped
#
barnabywalters
aaronpk: that’s certainly robust, but the lack of correct file extensions makes it much less observable
#
aaronpk
"correct" file extensions is a losing battle
#
barnabywalters
I also initially tried .headers and got fed up of not being able to hit space and quicklook at them
#
barnabywalters
s/correct/observable
tvn joined the channel
#
aaronpk
the problem is a URL that has no file extension might be an image, or might be a css file, and there's no way to tell from the URL
#
aaronpk
or css files might end in a query string and lots of numbers styles.css?v=18345
#
aaronpk
or a URL that ends in .png might not be a png (e.g. https://indiewebcamp.com/File:indiearchive-example-files.png
#
barnabywalters
I strip query strings from URLs, but get the feeling you want your archiver to be less opinionated :)
#
snarfed
kylewm: funny. kind of the opposite of https://snarfed.org/synchronizing_mp3_playback
#
KartikPrabhu
is there no way to use some sort of mime-type?
#
aaronpk
KartikPrabhu: that kind of thing tends to be filesystem-dependent
#
KartikPrabhu
aaronpk: no i mean use the mime-type of the request to decide the file extension?
#
KartikPrabhu
s/request/web request
#
Loqi
KartikPrabhu meant to say: aaronpk: no i mean use the mime-type of the web request to decide the file extension?
#
aaronpk
that sounds very prone to accidental clobbering on the filesystem
#
barnabywalters
you could maintain a list of the most common mime types and their equivalents, and use .data for rarer files
#
aaronpk
i'm pretty sure i could come up with a list of examples that would be impossible to store that way
#
aaronpk
another problem is relying on a mime type to determine path means you can't determine the path based only on the URL, so it becomes hard to programmatically find things later
#
KartikPrabhu
valid point ^
#
aaronpk
there's always going to be a tradeoff between the inspectability of the files vs robustness vs assumptions made about ppls URLs
#
aaronpk
with spiderpig, I made several assumptions that allowed me to create files on disk that can be served directly by a web server and result in the same website. however i had to do things like force every page to end in a slash, adding redirects that weren't on the original site
#
aaronpk
but that's safe for me to do in this case because the archive is replacing the original site, so I don't need to worry about replacing that URL
#
tantek
makes sense
#
aaronpk
hmm i suppose a glob could find a file named styles.css.YMD.hms.css or photo.YMD.hms.png
tantek joined the channel
#
aaronpk
that would slow down access time slightly, but that's not the end of the world
#
aaronpk
also could fix that by making a symlink from .data to .css
JarOfGreen and KevinMarks_ joined the channel
#
aaronpk
i kind of like that, it solves the inspectability issue
eschnou, zero-gravitas, KevinMarks and yakker joined the channel
KevinMarks__ and afrogeek joined the channel
csarven and eschnou joined the channel
#
snarfed
KevinMarks_: looks like more the openstack, docket, etc crowds
j12t, KevinMarks_, frzn_, acegiak and fkooman joined the channel
#
KevinMarks_
Tempted to add utm_source rewriting to autolink
almereyda joined the channel
#
@kevinmarks
@w3cmemes have you considered getting an #indieweb site?
(twitter.com/_/status/593876856799043584)
#
aaronpk
trololol
mlncn, parzzix and j12t joined the channel
#
@dailypia
Nun kämpfe ich schon seit 3 Tagen mit Webmention. Alles klappt, nur die likes und replys werden nicht im Frontend ausgegeben. HEUL!
(twitter.com/_/status/593882321113219074)
j12t, LCyrin, torrorist, tantek, elima, tantek_, lukebrooker, friedcell and wolftune joined the channel
#
Loqi
[mention] Barnaby Walters posted 'So far, a large part of my experimentations with graphical dataflow programming have been using Puredata which, whilst usable for general pr...' linking to http://indiewebcamp.com/https (/articles/how-to-stream-live-audio-over-the-web-using-icecast2-and-puredata/)
friedcell1 joined the channel
#
KevinMarks_
 Last week I wrote about Facebook’s AOL-like dominance and concluded, “What might be the broadband to Facebook’s dial-up?” The answer, I think, is this open Twitter: an identity system for the rest of the web that connects people and apps according to interests, not just superficial relationships, and monetizes accordingly.
#
snarfed
sure, no biggie, throw that together in a weekend or two :P
tantek and KartikPrabhu joined the channel
#
kylewm
it's kind of terrifying to think of twitter as the identity and communication substrate of the internet :p
#
aaronpk
nah don't do that
#
KevinMarks
as I last night described indieweb to people as "like an open twitter" I read that differently
#
GWG
KevinMarks: I thought identi.ca was like an open Twitter
#
KevinMarks
i explained the difference between a monoculture and actually open
#
KartikPrabhu
oh noes! how will I log in using the hotel wifi now?
snarfed joined the channel
#
GWG
I think the Internet could use an https/http 2 push
#
tantek
aaronpk - indeed - mass mozilla dev-platform thread on that with lots of debate from many sides
#
aaronpk
frankly i didn't expect a resolution on that so quickly
#
tantek
there's lots of caveats and to-be-scheduled/scoped in that "resolution"
frzn and snarfed joined the channel
#
@HongPong
even youtube links are getting kicked off of facebook, it's spilled outta control. this is why you need #indiewebcamp design principles
(twitter.com/_/status/593926822909321216)
j12t joined the channel