-
5 votes
-
Any resources exploring the gap between beginner and top 5% expert in various tech fields?
I am looking for any resource that could tell me how much knowledge and training is needed to go from a beginner to expert, in let's say application software development for example. Or in...
I am looking for any resource that could tell me how much knowledge and training is needed to go from a beginner to expert, in let's say application software development for example. Or in artificial intelligence.
If there isn't any one source, are there any general type of sources I can use to piece together one mega-source?
14 votes -
What editor do you use?
Hey y'all, first time actually posting something here! Just curious what editor people use, whether its for coding, writing, or just the occasional note, whatever. I've gone through most of the...
Hey y'all, first time actually posting something here! Just curious what editor people use, whether its for coding, writing, or just the occasional note, whatever. I've gone through most of the well-known ones (vim, emacs, atom, vs code for starters), but only ever really messed around with vim enough to like it, and I've also been trying out gedit for the last little while and really liking it, but I'm curious to see what other people use!
33 votes -
The Joy of Haxe
6 votes -
Mexican government officials visit Canada to learn about marijuana legalization
10 votes -
Could the display theme be account bound?
Edit: What I'm asking below is actually already an option, I guess I'm just bad at reading :-/ As of right now, when you choose a theme, a theme cookie is created which takes a simple string value...
Edit: What I'm asking below is actually already an option, I guess I'm just bad at reading :-/
As of right now, when you choose a theme, a
theme
cookie is created which takes a simple string value (white
,light
,dark
orblack
).It's straightforward and it works well, but for someone like me who set his browser to delete cookies at the end of his session, it's a little inconvenient to have to go into my settings to set a theme everytime I log on Tildes.
It's a low-priority request, obviously, but maybe you could consider it? I do understand that it makes sense to have it as a cookie since a user may prefer different themes on different devices.
In the meantime I think I'll just write a script to set my theme to black automatically.
8 votes -
Can Hasan Minhaj make topical comedy work on Netflix?
4 votes -
Raleigh Ritchie - Time in a Tree (2018)
4 votes -
United Nations cautions against unilateral recognition of Jerusalem as Israel's capital
13 votes -
Trust no one: How Le Carré's Little Drummer Girl predicted our dangerous world
5 votes -
How Telltale Games went broke in one week
12 votes -
The impact of gratitude on adolescent materialism and generosity
10 votes -
Two orbiters begin their long journey to Mercury Friday night
13 votes -
Not real news: The Associated Press reports on this week's most shared fake news
19 votes -
The suffocation of American democracy
8 votes -
XML Data Munging Problem
Here’s a problem I had to solve at work this week that I enjoyed solving. I think it’s a good programming challenge that will test if you really grok XML. Your input is some XML such as this:...
Here’s a problem I had to solve at work this week that I enjoyed solving. I think it’s a good programming challenge that will test if you really grok XML.
Your input is some XML such as this:
<DOC> <TEXT PARTNO="000"> <TAG ID="3">This</TAG> is <TAG ID="0">some *JUNK* data</TAG> . </TEXT> <TEXT PARTNO="001"> *FOO* Sometimes <TAG ID="1">tags in <TAG ID="0">the data</TAG> are nested</TAG> . </TEXT> <TEXT PARTNO="002"> In addition to <TAG ID="1">nested tags</TAG> , sometimes there is also <TAG ID="2">junk</TAG> we need to ignore . </TEXT> <TEXT PARTNO="003">*BAR*-1 <TAG ID="2">Junk</TAG> is marked by uppercase characters between asterisks and can also optionally be followed by a dash and then one or more digits . *JUNK*-123 </TEXT> <TEXT PARTNO="004"> Note that <TAG ID="4">*this*</TAG> is just emphasized . It's not <TAG ID="2">junk</TAG> ! </TEXT> </DOC>
The above XML has so-called in-line textual annotations because the XML
<TAG>
elements are embedded within the document text itself.Your goal is to convert the in-line XML annotations to so-called stand-off annotations where the text is separated from the annotations and the annotations refer to the text via slicing into the text as a character array with starting and ending character offsets. While in-line annotations are more human-readable, stand-off annotations are equally machine-readable, and stand-off annotations can be modified without changing the document content itself (the text is immutable).
The challenge, then, is to convert to a stand-off JSON format that includes the plain-text of the document and the XML tag annotations grouped by their tag element IDs. In order to preserve the annotation information from the original XML, you must keep track of each
<TAG>
’s starting and ending character offset within the plain-text of the document. The plain-text is defined as the character data in the XML document ignoring any junk. We’ll define junk as one or more uppercase ASCII characters[A-Z]+
between two*
, and optionally a trailing dash-
followed by any number of digits[0-9]+
.Here is the desired JSON output for the above example to test your solution:
{ "data": "\nThis is some data .\n\n\nSometimes tags in the data are nested .\n\n\nIn addition to nested tags , sometimes there is also junk we need to ignore .\n\nJunk is marked by uppercase characters between asterisks and can also optionally be followed by a dash and then one or more digits . \n\nNote that *this* is just emphasized . It's not junk !\n\n", "entities": [ { "id": 0, "mentions": [ { "start": 9, "end": 18, "id": 0, "text": "some data" }, { "start": 41, "end": 49, "id": 0, "text": "the data" } ] }, { "id": 1, "mentions": [ { "start": 33, "end": 60, "id": 1, "text": "tags in the data are nested" }, { "start": 80, "end": 91, "id": 1, "text": "nested tags" } ] }, { "id": 2, "mentions": [ { "start": 118, "end": 122, "id": 2, "text": "junk" }, { "start": 144, "end": 148, "id": 2, "text": "Junk" }, { "start": 326, "end": 330, "id": 2, "text": "junk" } ] }, { "id": 3, "mentions": [ { "start": 1, "end": 5, "id": 3, "text": "This" } ] }, { "id": 4, "mentions": [ { "start": 289, "end": 295, "id": 4, "text": "*this*" } ] } ] }
Python 3 solution here.
If you need a hint, see if you can find an event-based XML parser (or if you’re feeling really motivated, write your own).
4 votes -
Endless Depth - A Fractal Piece
7 votes -
Knuckle Puck - Double Helix (2017)
5 votes -
What we can learn from college football’s very first AP Poll, where Minnesota was No. 1
4 votes -
What I’ve learnt about parenting a queer teen
9 votes -
Funimation, Crunchyroll end content-sharing partnership
13 votes -
The LibriVox free audiobook collection
12 votes -
Jony Ive on the Apple Watch and Big Tech’s responsibilities
5 votes -
Extreme botany: The precarious science of endangered rare plants
7 votes -
Edible cottonseed is now a thing — and it could have big implications for world hunger
8 votes -
Gorky Park - Bang (1989)
4 votes -
Redis 5.0 released
15 votes -
100 Websites That Shaped the Internet as We Know It
9 votes -
What are some current examples of "the emperor's new clothes?"
For those unfamiliar with the story, "The Emperor's New Clothes" is about an emperor who parades around naked, but nobody will point out the obvious for fear of being seen as ignorant....
For those unfamiliar with the story, "The Emperor's New Clothes" is about an emperor who parades around naked, but nobody will point out the obvious for fear of being seen as ignorant. Idiomatically, it refers to something seen as true or widely praised, simply because nobody is willing to speak out against it.
I saw a rant about "blockchains" being the new overhyped hotness for tech companies, and it made me wonder what other "new clothes" are out there right now. What's something you have a strong takedown for that everybody else seems to love/support?
38 votes -
VC folks talk about social media, community, and the failings - includes ex-product head of YouTube
3 votes -
Discord just added a forced-arbitration clause to their Terms of Service (Discord staff response in comments)
39 votes -
The bad behavior of the richest: what I learned from wealth managers
16 votes -
Bomb the Music Industry! - Struggler (2010)
6 votes -
Your Real Biological Clock is You’re Going to Die
9 votes -
Japan's Hometown Tax
10 votes -
How mail-order catalogues subverted the racial hierarchy of Jim Crow
8 votes -
Apple CEO Tim Cook is calling for Bloomberg to retract its Chinese spy chip story
13 votes -
US charges Russian with trying to influence 2018 Midterms
7 votes -
What will be left of the people who make our games?
20 votes -
Halfbakery (is back)
13 votes -
Steam developers speak: Maximum profits for Valve, minimum responsibilities
10 votes -
Google responds to EU by adding a fee to Play Services
18 votes -
How alt-right is Nietzsche really?
6 votes -
Tucker Carlson says he can't go to restaurants anymore
12 votes -
Six red carnations and one severed ram’s head: Deadly threats sent to Russian independent newspaper
6 votes -
What Maniac does (and doesn't) get right about the Bible and the Gnostics
5 votes -
Ubuntu 18.10 released
28 votes -
What have you been listening to this week?
What have you been listening to this week? You don't need to do a 6000 word review if you don't want to, but please write something! Feel free to give recs or discuss anything about each others'...
What have you been listening to this week? You don't need to do a 6000 word review if you don't want to, but please write something!
Feel free to give recs or discuss anything about each others' listening habits.
You can make a chart if you use last.fm:
http://www.tapmusic.net/lastfm/
Remember that linking directly to your image will update with your future listening, make sure to reupload to somewhere like imgur if you'd like it to remain what you have at the time of posting.
16 votes -
Deutschland 86: Spy drama rebooted, braced for communism's collapse
4 votes -
Neither voter suppression nor being called 'coon' and the N-word will stop Black Georgia from voting
7 votes