Actions

Twitter2Mp3

From Mike Beane's Blog

Revision as of 11:30, 24 March 2009 by Rabbi Bob (talk | contribs) (→‎Text)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

I have no idea if this will go anywhere and the concept came from listening to Adam Curry on No Agenda.

Idea

Premise: parse Twitter feed using Perl, utilize Text2Wav (Festival) to convert to a wav file and then LAME encode the result to mp3.

text2wave -F 16000 htt.txt -o test2.wav
lame -s 8 test2.wav test2.mp3

Audio Result

<flashmp3>http://www.rabbibob.com/cstrike/20090324_twitter.mp3%7Cautostart=no%7Cloop=no%7CfirstTrackName=Twitter2Mp3 Example</flashmp3>

Text

  • implemented 'bookmark' - records id from update to file
    • read in bookmark and start from that marker (200 max)
  • replaced http urls with 'weblink'
  • reduced multiple .. to .
  • replaced minor emoticons with words
  • reversed tweet order (now reads oldest -> newest)
  • reports # of tweets since last update (200 max)
  • added extra line break - helps text2wav 'breath' between tweets
There have been 200 tweets since the last update

Tweet 1 pralphman said Checking out some film trailers on apple.com Seems there are some fairly good comedies coming in the next few months.

Tweet 2 Oblivion said Crap. Yesterday was the first day in 42 that I missed doing at least the body test on the #WiiFit. Foo, its disappointed with me.

Tweet 3 dashdrum said @toyotaboy yeah, but my co-workers don't know they aren't clean :-)

Tweet 4 dashdrum said Hot chocolate and pop-tarts.  Breakfast of champions!

Tweet 5 fortheinsane said @dashdrum what flavour pop tarts?

Tweet 6 draithon said @UberSchwarz u may feel like cattle when they herd u on, but hey for the prices they charge I'm not complaining, ok I want more destinations

Tweet 7 dashdrum said @fortheinsane Brown sugar cinnamon.  The best one!

Tweet 8 draithon said I'm overly concerned about a proposal that could cap a persons compensation for their job The interpretations of that law could be disaster

Tweet 9 draithon said America was built on hard work.  If you cap wages, where is motivation going to be.  Hate to say it but greed is a great motivator.

Tweet 10 draithon said I'm not in finance, but I can see this proposal being interpreted & construed against other jobs.  Ultimately hurting the middle class.

Tweet 11 draithon said I usually try to stay away from politics, but the last 18 months - 2 years I have been overly studying & paying better attention to them.

Tweet 12 draithon said lol

Tweet 13 draithon said Dam @twitterfox is getting aggravating.  It submitted the tweet before I was ready to.  Just another thing for this monday I guess

Tweet 14 adamcurry said Preferred format for promos is mp3

Tweet 15 marcapitman posted Great Foxtrot cartoon on Twitter and parenting Weblink  Props to @trwl

Tweet 16 cc_chapman posted SXSW Was a Success For Me -  Weblink 

Tweet 17 tomit posted Have maxed out my Mafia members of Facebook. If anyone wants to play or wants to add me here's the link to my FB page. Weblink 

Tweet 18 tomit posted I will be accepting almost all adds on Facebook for a little while. Weblink 

Tweet 19 thurrott said Good morning. I emerge from my cave this morning with a few chapters done ("Windows 7 Secrets") but hundreds of screenshots yet to take. Joy

Tweet 20 Oblivion said Should shave my head today. Sunburn dictates otherwise.

Tweet 21 adamcurry said Done with accountants. Astoundingly simple. Turns out pysical gold makes life easier in that regard smile

Tweet 22 JohnCleese posted Just had a web minion add me to the Weblink  twitter directory under:  #British #Actor #Writer

Tweet 23 Shultzman said woke up to sunny skies and the sound of birds singing.  I LOVE SPRING!!!

Tweet 24 merlyn said completely off oxycontin; didn't even take any ibuprofen between 9pm and 6am.  hardest part has passed. tomorrow: bandages off!

Fetch Code

#!/usr/bin/perl
use strict;
use warnings;
use Net::Twitter;

my $twit = Net::Twitter->new({username=>"user", password=>"password" });
my $last_id=1366625646;  #prime the last_id for first run

if (-e "bookmark.txt")   #get the last_id
    {
    open BOOKMARK, "<", "bookmark.txt" or die $!;
    while (<BOOKMARK>)
        {
        if ($_>$last_id)   #keep track of the most recent update to dump to bookmark later
            {
            $last_id=$_;
            }
        }
    close BOOKMARK;
    } 

my @timeline = $twit->friends_timeline({count => 200, since_id => $last_id });
my $timeline;
my $i=0;

while ($timeline[0][$i])
    {
    $i++;
    }

print "There have been $i tweets since the last update\n\n";

my $counter=1;

    $i--;

for my $textout (reverse 1 .. $i)
    {
   
    my $location=$timeline[0][$textout]{'user'}{'location'};
    my $id=$timeline[0][$textout]{'id'};
    
    if ($id>$last_id)   #keep track of the most recent update to dump to bookmark later
        {
        $last_id=$id;
        }
    my $screenname=$timeline[0][$textout]{'user'}{'screen_name'};
    my $msg=$timeline[0][$textout]{'text'};
    
    my $updatetype;
    
    if ($msg=~m/http/)
        {
        $updatetype="posted";
        }
    else
        {$updatetype="said"}
    
    
    # Need to develop a scrubber routine that takes an input file
    
    $msg=~s/https?:\/\/([-\w\.]+)+(:\d+)?(\/([\w\/_\.]*(\?\S+)?)?)?/Weblink /g;
    $msg=~s/RT/retweet/g;
    $msg=~s/\.\./\./g;
    $msg=~s/:\)/smile/g;
    $msg=~s/;\)/wink/g;
    $msg=~s/:\(/frown/g;
    
    # Do I want a file list of people to NOT put to the output?    
    print "Tweet $counter $screenname $updatetype $msg\n\n";    
    $counter++;
    }



open BOOKMARK, ">", "bookmark.txt" or die $!;  #set the last_id for the future
print BOOKMARK "$last_id";
close BOOKMARK;

Code Notes

$msg=~s/https?:\/\/([-\w\.]+)+(:\d+)?(\/([\w\/_\.]*(\?\S+)?)?)?/link /g;

Fetch Result

results in (chopped)

$VAR1 = [
          [
            {
              'source' => 'web',
              'favorited' => bless( do{\(my $o = 0)}, 'JSON::PP::Boolean' ),
              'truncated' => $VAR1->[0][0]{'favorited'},
              'created_at' => 'Sat Mar 21 10:27:17 +0000 2009',
              'text' => '@uberschwarz @draithon short HDR bit discussed on TFTTF #346',
              'user' => {
                          'location' => 'Behind you...',
                          'followers_count' => 99,
                          'profile_image_url' => 'http://s3.amazonaws.com/twitter_production/profile_images/105270932/3371878294_7e0723920e_normal.jpg',
                          'protected' => $VAR1->[0][0]{'favorited'},
                          'name' => 'Rabbi Bob',
                          'url' => 'http://www.rabbibob.com/',
                          'id' => 2955291,
                          'description' => 'degradable',
                          'screen_name' => 'rabbibob'
                        },
              'in_reply_to_user_id' => 3708131,
              'id' => 1365343447,
              'in_reply_to_status_id' => undef,
              'in_reply_to_screen_name' => 'UberSchwarz'
            }
      ]
]